Multimodal Machine Learning in Image-Based and Clinical Biomedicine: Survey and Prospects-Reference-Cited by-同舟云学术

Multimodal Machine Learning in Image-Based and Clinical Biomedicine: Survey and Prospects

Published:2024-04-23 Issue:9 Volume:132 Page:3753-3769
ISSN:0920-5691
Container-title:International Journal of Computer Vision
language:en
Short-container-title:Int J Comput Vis

Author:

Warner Elisa^ORCID,Lee Joonsang,Hsu William,Syeda-Mahmood Tanveer,Kahn Charles E.,Gevaert Olivier,Rao Arvind

Abstract

AbstractMachine learning (ML) applications in medical artificial intelligence (AI) systems have shifted from traditional and statistical methods to increasing application of deep learning models. This survey navigates the current landscape of multimodal ML, focusing on its profound impact on medical image analysis and clinical decision support systems. Emphasizing challenges and innovations in addressing multimodal representation, fusion, translation, alignment, and co-learning, the paper explores the transformative potential of multimodal models for clinical predictions. It also highlights the need for principled assessments and practical implementation of such models, bringing attention to the dynamics between decision support systems and healthcare providers and personnel. Despite advancements, challenges such as data biases and the scarcity of “big data” in many biomedical domains persist. We conclude with a discussion on principled innovation and collaborative efforts to further the mission of seamless integration of multimodal ML models into biomedical practice.

Funder

Foundation for the National Institutes of Health

Center for Strategic Scientific Initiatives, National Cancer Institute

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s11263-024-02032-8.pdf

Reference97 articles.

1. Abdar, M., Samami, M., Mahmoodabad, S. D., Doan, T., Mazoure, B., Hashemifesharaki, R., Liu, L., Khosravi, A., Acharya, U. R., Makarenkov, V., & Nahavandi, S. (2021). Uncertainty quantification in skin cancer classification using three-way decision-based Bayesian deep learning. Computers in Biology and Medicine, 135, 104418. https://doi.org/10.1016/j.compbiomed.2021.104418

2. Adamson, A. S., & Welch, H. G. (2019). Machine learning and the cancer-diagnosis problem—No gold standard. New England Journal of Medicine, 381(24), 2285–2287. https://doi.org/10.1056/nejmp1907407

3. Ancker, J. S., Edwards, A., Nosal, S., Hauser, D., Mauer, E., & Kaushal, R. (2017). Effects of workload, work complexity, and repeated alerts on alert fatigue in a clinical decision support system. BMC Medical Informatics and Decision Making. https://doi.org/10.1186/s12911-017-0430-8

4. Azcona, E. A., Besson, P., Wu, Y., Punjabi, A., Martersteck, A., Dravid, A., Parrish, T. B., Bandt, S. K., & Katsaggelos, A. K. (2020). Interpretation of brain morphology in association to Alzheimer’s disease dementia classification using graph convolutional networks on triangulated meshes. In Shape in medical imaging (pp. 95–107). Springer. https://doi.org/10.1007/978-3-030-61056-2_8

5. Bahdanau, D., Cho, K., & Bengio, Y. (2015). Neural machine translation by jointly learning to align and translate. In Y. Bengio, Y. LeCun (Eds.), 3rd international conference on learning representations, ICLR 2015, San Diego, CA, USA, May 7–9, 2015, Conference track proceedings. arxiv:1409.0473.