Author:
Evans Lauren,Wu Yiyuan,Xi Wenna,Ghosh Arnab K.,Kim Min-hyung,Alexopoulos George S.,Pathak Jyotishman,Banerjee Samprit
Abstract
Abstract
Background
A significant number of late middle-aged adults with depression have a high illness burden resulting from chronic conditions which put them at high risk of hospitalization. Many late middle-aged adults are covered by commercial health insurance, but such insurance claims have not been used to identify the risk of hospitalization in individuals with depression. In the present study, we developed and validated a non-proprietary model to identify late middle-aged adults with depression at risk for hospitalization, using machine learning methods.
Methods
This retrospective cohort study involved 71,682 commercially insured older adults aged 55–64 years diagnosed with depression. National health insurance claims were used to capture demographics, health care utilization, and health status during the base year. Health status was captured using 70 chronic health conditions, and 46 mental health conditions. The outcomes were 1- and 2-year preventable hospitalization. For each of our two outcomes, we evaluated seven modelling approaches: four prediction models utilized logistic regression with different combinations of predictors to evaluate the relative contribution of each group of variables, and three prediction models utilized machine learning approaches - logistic regression with LASSO penalty, random forests (RF), and gradient boosting machine (GBM).
Results
Our predictive model for 1-year hospitalization achieved an AUC of 0.803, with a sensitivity of 72% and a specificity of 76% under the optimum threshold of 0.463, and our predictive model for 2-year hospitalization achieved an AUC of 0.793, with a sensitivity of 76% and a specificity of 71% under the optimum threshold of 0.452. For predicting both 1-year and 2-year risk of preventable hospitalization, our best performing models utilized the machine learning approach of logistic regression with LASSO penalty which outperformed more black-box machine learning models like RF and GBM.
Conclusions
Our study demonstrates the feasibility of identifying depressed middle-aged adults at higher risk of future hospitalization due to burden of chronic illnesses using basic demographic information and diagnosis codes recorded in health insurance claims. Identifying this population may assist health care planners in developing effective screening strategies and management approaches and in efficient allocation of public healthcare resources as this population transitions to publicly funded healthcare programs, e.g., Medicare in the US.
Funder
National Institute of Mental Health,United States
National Institute of Mental Health
Publisher
Springer Science and Business Media LLC
Reference60 articles.
1. Katon WJ. Epidemiology and treatment of depression in patients with chronic medical illness. Dialogues Clin Neurosci. 2011;13(1):7–23.
2. Percentage of Adults Aged ≥18 Years with Diagnosed Heart Disease, by Urbanization Level and Age Group — National Health Interview Survey, United States, 2020. MMWR Morb Mortal Wkly Rep 2022;71:778. https://doi.org/10.15585/mmwr.mm7123a4
3. Ovbiagele B, Nguyen-Huynh MN. Stroke epidemiology: advancing our understanding of disease mechanism and therapy. Neurotherapeutics. 2011;8(3):319–29.
4. Centers for Disease Control and Prevention. United States Cancer Statistics: Highlights from 2019 Incidence. USCS Data Brief, no. 29. Atlanta, GA: Centers for Disease Control and Prevention, US Department of Health and Human Services; 2022.
5. Martin LG, Freedman VA, Schoeni RF, Andreski PM. Trends in disability and related chronic conditions among people ages fifty to sixty-four. Health Aff (Millwood). 2010;29(4):725–31.