Reporting quality of studies using machine learning models for medical diagnosis: a systematic review-Reference-Cited by-同舟云学术

Reporting quality of studies using machine learning models for medical diagnosis: a systematic review

Published:2020-03 Issue:3 Volume:10 Page:e034568
ISSN:2044-6055
Container-title:BMJ Open
language:en
Short-container-title:BMJ Open

Author:

Yusuf Mohamed^ORCID,Atal Ignacio,Li Jacques,Smith Philip,Ravaud Philippe,Fergie Martin,Callaghan Michael,Selfe James

Abstract

AimsWe conducted a systematic review assessing the reporting quality of studies validating models based on machine learning (ML) for clinical diagnosis, with a specific focus on the reporting of information concerning the participants on which the diagnostic task was evaluated on.MethodMedline Core Clinical Journals were searched for studies published between July 2015 and July 2018. Two reviewers independently screened the retrieved articles, a third reviewer resolved any discrepancies. An extraction list was developed from the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis guideline. Two reviewers independently extracted the data from the eligible articles. Third and fourth reviewers checked, verified the extracted data as well as resolved any discrepancies between the reviewers.ResultsThe search results yielded 161 papers, of which 28 conformed to the eligibility criteria. Detail of data source was reported in 24 of the 28 papers. For all of the papers, the set of patients on which the ML-based diagnostic system was evaluated was partitioned from a larger dataset, and the method for deriving such set was always reported. Information on the diagnostic/non-diagnostic classification was reported well (23/28). The least reported items were the use of reporting guideline (0/28), distribution of disease severity (8/28 patient flow diagram (10/28) and distribution of alternative diagnosis (10/28). A large proportion of studies (23/28) had a delay between the conduct of the reference standard and ML tests, while one study did not and four studies were unclear. For 15 studies, it was unclear whether the evaluation group corresponded to the setting in which the ML test will be applied to.ConclusionAll studies in this review failed to use reporting guidelines, and a large proportion of them lacked adequate detail on participants, making it difficult to replicate, assess and interpret study findings.PROSPERO registration numberCRD42018099167.

Publisher

BMJ

Subject

General Medicine

Reference34 articles.

1. Neural networks and statistical techniques: A review of applications

2. Cleophas TJ , Zwinderman AH . Machine Learning in Medicine - a Complete Overview. Springer International Publishing, 2015.

3. High-performance medicine: the convergence of human and artificial intelligence

4. Dermatologist-level classification of skin cancer with deep neural networks

5. Electrodiagnosis support system for localizing neural injury in an upper limb

Cited by 76 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine learning-based prognostic model for 30-day mortality prediction in Sepsis-3;BMC Medical Informatics and Decision Making;2024-09-09

2. Using machine learning methods to predict all-cause somatic hospitalizations in adults: A systematic review;PLOS ONE;2024-08-23

3. A Novel Machine-Learning Algorithm to Predict the Early Termination of Nutrition Support Team Follow-Up in Hospitalized Adults: A Retrospective Cohort Study;Nutrients;2024-07-31

4. Evaluating artificial intelligence for medical imaging: a primer for clinicians;British Journal of Hospital Medicine;2024-07-30

5. A Responsible Framework for Applying Artificial Intelligence on Medical Images and Signals at the Point of Care: The PACS-AI Platform;Canadian Journal of Cardiology;2024-06