Cancer Metastasis Prediction and Genomic Biomarker Identification through Machine Learning and eXplainable Artificial Intelligence in Breast Cancer Research
-
Published:2023-10-26
Issue:21
Volume:13
Page:3314
-
ISSN:2075-4418
-
Container-title:Diagnostics
-
language:en
-
Short-container-title:Diagnostics
Author:
Yagin Burak1ORCID, Yagin Fatma1ORCID, Colak Cemil1ORCID, Inceoglu Feyza2, Kadry Seifedine345ORCID, Kim Jungeun6
Affiliation:
1. Department of Biostatistics and Medical Informatics, Faculty of Medicine, Inonu University, Malatya 44280, Turkey 2. Department of Biostatistics, Faculty of Medicine, Malatya Turgut Ozal University, Malatya 44090, Turkey 3. Department of applied Data science, Noroff University College, 4612 Kristiansand, Norway 4. Artificial Intelligence Research Center (AIRC), Ajman University, Ajman 346, United Arab Emirates 5. Department of Electrical and Computer Engineering, Lebanese American University, Byblos 36, Lebanon 6. Department of Software, Kongju National University, Cheonan 31080, Republic of Korea
Abstract
Aim: Method: This research presents a model combining machine learning (ML) techniques and eXplainable artificial intelligence (XAI) to predict breast cancer (BC) metastasis and reveal important genomic biomarkers in metastasis patients. Method: A total of 98 primary BC samples was analyzed, comprising 34 samples from patients who developed distant metastases within a 5-year follow-up period and 44 samples from patients who remained disease-free for at least 5 years after diagnosis. Genomic data were then subjected to biostatistical analysis, followed by the application of the elastic net feature selection method. This technique identified a restricted number of genomic biomarkers associated with BC metastasis. A light gradient boosting machine (LightGBM), categorical boosting (CatBoost), Extreme Gradient Boosting (XGBoost), Gradient Boosting Trees (GBT), and Ada boosting (AdaBoost) algorithms were utilized for prediction. To assess the models’ predictive abilities, the accuracy, F1 score, precision, recall, area under the ROC curve (AUC), and Brier score were calculated as performance evaluation metrics. To promote interpretability and overcome the “black box” problem of ML models, a SHapley Additive exPlanations (SHAP) method was employed. Results: The LightGBM model outperformed other models, yielding remarkable accuracy of 96% and an AUC of 99.3%. In addition to biostatistical evaluation, in XAI-based SHAP results, increased expression levels of TSPYL5, ATP5E, CA9, NUP210, SLC37A1, ARIH1, PSMD7, UBQLN1, PRAME, and UBE2T (p ≤ 0.05) were found to be associated with an increased incidence of BC metastasis. Finally, decreased levels of expression of CACTIN, TGFB3, SCUBE2, ARL4D, OR1F1, ALDH4A1, PHF1, and CROCC (p ≤ 0.05) genes were also determined to increase the risk of metastasis in BC. Conclusion: The findings of this study may prevent disease progression and metastases and potentially improve clinical outcomes by recommending customized treatment approaches for BC patients.
Funder
Technology Development Program of MSS research grant of the Kongju National University
Subject
Clinical Biochemistry
Reference47 articles.
1. Hamood, H. (2020). Adverse Conditions in Breast Cancer Survivors: Incidence, Determinants, and Effect on Quality of Life, University of Haifa. 2. Breast cancer early detection: A phased approach to implementation;Ginsburg;Cancer,2020 3. Mohammadi, M., Mohammadi, S., Hadizadeh, H., Olfati, M., Moradi, F., Tanzifi, G., and Ghaderi, S. (2023). Brain metastases from breast cancer using magnetic resonance imaging: A systematic review. J. Med. Radiat. Sci., 1–8. 4. Bioinformatics identified 17 immune genes as prognostic biomarkers for breast cancer: Application study based on artificial intelligence algorithms;Zhang;Front. Oncol.,2020 5. Kudela, E., Samec, M., Kubatka, P., Nachajova, M., Laucekova, Z., Liskova, A., Dokus, K., Biringer, K., Simova, D., and Gabonova, E. (2019). Breast cancer in young women: Status quo and advanced disease management by a predictive, preventive, and personalized approach. Cancers, 11.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|