Predicting Depression Risk in Patients With Cancer Using Multimodal Data: Algorithm Development Study-Reference-Cited by-同舟云学术

Predicting Depression Risk in Patients With Cancer Using Multimodal Data: Algorithm Development Study

Published:2024-01-18 Issue: Volume:12 Page:e51925
ISSN:2291-9694
Container-title:JMIR Medical Informatics
language:en
Short-container-title:JMIR Med Inform

Author:

de Hond Anne^ORCID,van Buchem Marieke^ORCID,Fanconi Claudio^ORCID,Roy Mohana^ORCID,Blayney Douglas^ORCID,Kant Ilse^ORCID,Steyerberg Ewout^ORCID,Hernandez-Boussard Tina^ORCID

Abstract

Background Patients with cancer starting systemic treatment programs, such as chemotherapy, often develop depression. A prediction model may assist physicians and health care workers in the early identification of these vulnerable patients. Objective This study aimed to develop a prediction model for depression risk within the first month of cancer treatment. Methods We included 16,159 patients diagnosed with cancer starting chemo- or radiotherapy treatment between 2008 and 2021. Machine learning models (eg, least absolute shrinkage and selection operator [LASSO] logistic regression) and natural language processing models (Bidirectional Encoder Representations from Transformers [BERT]) were used to develop multimodal prediction models using both electronic health record data and unstructured text (patient emails and clinician notes). Model performance was assessed in an independent test set (n=5387, 33%) using area under the receiver operating characteristic curve (AUROC), calibration curves, and decision curve analysis to assess initial clinical impact use. Results Among 16,159 patients, 437 (2.7%) received a depression diagnosis within the first month of treatment. The LASSO logistic regression models based on the structured data (AUROC 0.74, 95% CI 0.71-0.78) and structured data with email classification scores (AUROC 0.74, 95% CI 0.71-0.78) had the best discriminative performance. The BERT models based on clinician notes and structured data with email classification scores had AUROCs around 0.71. The logistic regression model based on email classification scores alone performed poorly (AUROC 0.54, 95% CI 0.52-0.56), and the model based solely on clinician notes had the worst performance (AUROC 0.50, 95% CI 0.49-0.52). Calibration was good for the logistic regression models, whereas the BERT models produced overly extreme risk estimates even after recalibration. There was a small range of decision thresholds for which the best-performing model showed promising clinical effectiveness use. The risks were underestimated for female and Black patients. Conclusions The results demonstrated the potential and limitations of machine learning and multimodal models for predicting depression risk in patients with cancer. Future research is needed to further validate these models, refine the outcome label and predictors related to mental health, and address biases across subgroups.

Publisher

JMIR Publications Inc.

Reference62 articles.

1. Anxiety and depression after cancer diagnosis: Prevalence rates by cancer type, gender, and age

2. Depression in cancer patients: Pathogenesis, implications and treatment (Review)

3. Depression and anxiety in patients with cancer

4. Depression and degree of acceptance of adjuvant cytotoxic drugs

5. Depressive symptoms and quality of life in home-care-assisted cancer patients

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Applying natural language processing to patient messages to identify depression concerns in cancer patients;Journal of the American Medical Informatics Association;2024-07-17

2. Exploring the Nexus of Inflammation, Depression and Pancreatic Cancer Through Machine Learning (Preprint);2024-07-17

3. COMORBIDITY IN ONCOLOGY: MODERN CHALLENGES AND THE SEARCH FOR WAYS TO SOLVE THE PROBLEM;Clinical and Preventive Medicine;2024-05-08

4. Navigating the Intersection of Technology and Depression Precision Medicine;Advances in Experimental Medicine and Biology;2024