Development of machine learning models for predicting acute respiratory distress syndrome:evidence from the MIMIC-III and MIMIC-IV

Author:

Yang MingKun1,Hu WeiHang2,Yan Jing2

Affiliation:

1. Department of Second Clinical Medical College, Zhejiang Chinese Medicine University

2. Department of Critical Care Medicine, Zhejiang Hospital

Abstract

Abstract Background Acute Respiratory Distress Syndrome (ARDS) is a prevalent condition in the ICU with a mortality rate of 27% to 45%. Despite the Berlin definition being the current diagnostic standard, it has significant limitations. This study aims to establish and validate a novel machine learning-based prediction model for ARDS in ICU patients. Methods The data of suspected ARDS patients was extracted from the Medical Information Mart for Intensive Care (MIMIC)-III and MIMIC-IV databases. Ten-fold cross-validation was employed, utilizing machine learning algorithms such as K-nearest neighbor (KNN), support vector machine (SVM), Decision Tree Classifier (DTC), random forest (RF), extreme gradient boosting (XGBoost), light gradient boosting (LightGB), and categorical boosting (CatBoost) and logistic regression (LR) for model construction. Finally, the performance of these models was evaluated based on metrics including area under the ROC curve, calibration curve, and clinical decision curve. Results A total of 2,852 patients who met the exclusion criteria were included in the final study, of which 2078 patients developed ARDS.We established scoring models, such as LR, KNN, SVM, DTC, RF, XGBoost, LightGB, and CatBoost. The area under the receiver operating characteristic curve (AUC) values for each model were as follows: LR - 0.664, KNN - 0.692, SVM - 0.567, DTC - 0.709, RF - 0.732, XGBoost - 0.793, LightGB - 0.793, and CatBoost - 0.817. Notably, CatBoost exhibited superior predictive performance in discrimination, calibration, and clinical applicability compared to all other models. Conclusions The application of machine learning models has showcased their robustness in predicting ARDS. Notably, the CatBoost algorithm emerges as the most promising in terms of predictive performance.

Publisher

Research Square Platform LLC

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3