Digital medicine and the curse of dimensionality-Reference-Cited by-同舟云学术

Digital medicine and the curse of dimensionality

Published:2021-10-28 Issue:1 Volume:4 Page:
ISSN:2398-6352
Container-title:npj Digital Medicine
language:en
Short-container-title:npj Digit. Med.

Author:

Berisha Visar^ORCID,Krantsevich Chelsea^ORCID,Hahn P. Richard,Hahn Shira,Dasarathy Gautam,Turaga Pavan,Liss Julie

Abstract

AbstractDigital health data are multimodal and high-dimensional. A patient’s health state can be characterized by a multitude of signals including medical imaging, clinical variables, genome sequencing, conversations between clinicians and patients, and continuous signals from wearables, among others. This high volume, personalized data stream aggregated over patients’ lives has spurred interest in developing new artificial intelligence (AI) models for higher-precision diagnosis, prognosis, and tracking. While the promise of these algorithms is undeniable, their dissemination and adoption have been slow, owing partially to unpredictable AI model performance once deployed in the real world. We posit that one of the rate-limiting factors in developing algorithms that generalize to real-world scenarios is the very attribute that makes the data exciting—their high-dimensional nature. This paper considers how the large number of features in vast digital health data can challenge the development of robust AI models—a phenomenon known as “the curse of dimensionality” in statistical learning theory. We provide an overview of the curse of dimensionality in the context of digital health, demonstrate how it can negatively impact out-of-sample performance, and highlight important considerations for researchers and algorithm designers.

Funder

U.S. Department of Health & Human Services | National Institutes of Health

United States Department of Defense | United States Navy | Office of Naval Research

U.S. Department of Health & Human Services | NIH | National Institute on Deafness and Other Communication Disorders

Publisher

Springer Science and Business Media LLC

Subject

Health Information Management,Health Informatics,Computer Science Applications,Medicine (miscellaneous)

Link

https://www.nature.com/articles/s41746-021-00521-5.pdf

Reference51 articles.

1. Food and Drug Administration. Proposed regulatory framework for modifications to artificial intelligence/machine learning (AI/ML)-based software as a medical device (SaMD). https://www.regulations.gov/document/FDA-2019-N-1185-0001 (2019).

2. Topol, E. J. High-performance medicine: the convergence of human and artificial intelligence. Nat. Med. 25, 44–56 (2019).

3. Ross, C. & Swetlitz, I. IBM’s Watson supercomputer recommended ‘unsafe and incorrect’ cancer treatments, internal documents show. Stat News. https://www.statnews.com/2018/07/25/ibm-watson-recommended-unsafe-incorrect-treatments/ (2018).

4. Koutroumbas, K. & Theodoridis, S. Pattern Recognition (4th Ed.). (Elsevier Inc., Burlington, 2009).

5. Verma, M., Hontecillas, R., Tubau-Juni, N., Abedi, V. & Bassaganya-Riera, J. Challenges in personalized nutrition and health. Front. Nutr. 5, 117 (2018).

Cited by 144 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Why consider quantum instead classical pattern recognition techniques?;Applied Soft Computing;2024-11

2. The challenges of using machine learning models in psychiatric research and clinical practice;European Neuropsychopharmacology;2024-11

3. Negligible effect of brain MRI data preprocessing for tumor segmentation;Biomedical Signal Processing and Control;2024-10

4. Machine learning applications in preventive healthcare: A systematic literature review on predictive analytics of disease comorbidity from multiple perspectives;Artificial Intelligence in Medicine;2024-10

5. Artificial intelligence in metabolomics: a current review;TrAC Trends in Analytical Chemistry;2024-09