Evolution of Breast Cancer Recurrence Risk Prediction: A Systematic Review of Statistical and Machine Learning

Evolution of Breast Cancer Recurrence Risk Prediction: A Systematic Review of Statistical and Machine Learning–Based Models

Published:2023-08 Issue:7 Volume: Page:
ISSN:2473-4276
Container-title:JCO Clinical Cancer Informatics
language:en
Short-container-title:JCO Clinical Cancer Informatics

Author:

El Haji Hasna¹²³^ORCID,Souadka Amine⁴^ORCID,Patel Bhavik N.¹²,Sbihi Nada³,Ramasamy Gokul¹²^ORCID,Patel Bhavika K.¹^ORCID,Ghogho Mounir³⁵^ORCID,Banerjee Imon¹²^ORCID

Affiliation:

1. Department of Radiology, Mayo Clinic, Phoenix, AZ

2. School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ

3. International University of Rabat, TICLab, Rabat, Morocco

4. Surgical Oncology Department, National Institute of Oncology, Mohammed V University in Rabat, Rabat, Morocco

5. University of Leeds, Faculty of Engineering, Leeds, United Kingdom

Abstract

PURPOSE Selection of appropriate adjuvant therapy to ultimately reduce the risk of breast cancer (BC) recurrence is a challenge for medical oncologists. Several automated risk prediction models have been developed using retrospective clinical data and have evolved significantly over the years in terms of predictors of recurrence, data usage, and predictive techniques (statistical/machine learning [ML]). METHODS Following PRISMA guidelines, we performed a systematic literature review of the aforementioned statistical and ML models published between January 2008 and December 2022 through searching five digital databases—PubMed, ScienceDirect, Scopus, Cochrane, and Web of Science. The comprehensive search yielded a total of 163 papers and after a screening process focusing on papers that dealt exclusively with statistical/ML methods, only 23 papers were deemed appropriate for further analysis. We benchmarked the studies on the basis of development, evaluation metrics, and validation strategy with an added emphasis on racial diversity of patients included in the studies. RESULTS In total, 30.4% of the included studies use statistical techniques, while 69.6% are ML-based. Among these, traditional ML models (support vector machines, decision tree, logistic regression, and naïve Bayes) are the most frequently used (26.1%) along with deep learning (26.1%). Deep learning and ensemble learning provide the most accurate predictions (AUC = 0.94 each). CONCLUSION ML-based prediction models exhibit outstanding performance, yet their practical applicability might be hindered by limited interpretability and reduced generalization. Moreover, predictive models for BC recurrence often focus on limited variables related to tumor, treatment, molecular, and clinical features. Imbalanced classes and the lack of open-source data sets impede model development and validation. Furthermore, existing models predominantly overlook African and Middle Eastern populations, as they are trained and validated mainly on Caucasian and Asian patients.

Publisher

American Society of Clinical Oncology (ASCO)

Subject

General Medicine

Link

https://ascopubs.org/doi/pdfdirect/10.1200/CCI.23.00049

Reference47 articles.

1. Overall Mortality After Diagnosis of Breast Cancer in Men vs Women

2. Breast Cancer Prevention: Time for Change

3. Multidisciplinary team meeting as a highly recommended EUSOMA criteria evaluating the quality of breast cancer management between centers

4. Prediction of BRCA Gene Mutation in Breast Cancer Based on Deep Learning and Histopathology Images

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Unveiling breast cancer risk profiles: a survival clustering analysis empowered by an online web application;Future Oncology;2023-12

2. Machine learning for risk stratification of thyroid cancer patients: a 15-year cohort study;European Archives of Oto-Rhino-Laryngology;2023-10-30