A machine-learning parsimonious multivariable predictive model of mortality risk in patients with Covid-19

Author:

Murri Rita,Lenkowicz Jacopo,Masciocchi Carlotta,Iacomini Chiara,Fantoni Massimo,Damiani Andrea,Marchetti Antonio,Sergi Paolo Domenico Angelo,Arcuri Giovanni,Cesario Alfredo,Patarnello Stefano,Antonelli Massimo,Bellantone Rocco,Bernabei Roberto,Boccia Stefania,Calabresi Paolo,Cambieri Andrea,Cauda Roberto,Colosimo Cesare,Crea Filippo,De Maria Ruggero,De Stefano Valerio,Franceschi Francesco,Gasbarrini Antonio,Parolini Ornella,Richeldi Luca,Sanguinetti Maurizio,Urbani Andrea,Zega Maurizio,Scambia Giovanni,Valentini Vincenzo,Armuzzi Alessandro,Barba Marta,Baroni Silvia,Bellesi Silvia,Bentivoglio Annarita,Biasucci Luigi Marzio,Biscetti Federico,Candelli Marcello,Capalbo Gennaro,Cattani Paola,Chiusolo Patrizia,Cingolani Antonella,Corbo Giuseppe,Covino Marcello,Cozzolino Angela Maria,D’Alfonso Marilena,De Angelis Giulia,De Pascale Gennaro,Frisullo Giovanni,Gabrielli Maurizio,Gambassi Giovanni,Garcovich Matteo,Gremese Elisa,Grieco Domenico Luca,Iaconelli Amerigo,Iorio Raffaele,Landi Francesco,Larici Annarita,Liuzzo Giovanna,Maviglia Riccardo,Miele Luca,Montalto Massimo,Natale Luigi,Nicolotti Nicola,Ojetti Veronica,Pompili Maurizio,Posteraro Brunella,Rapaccini Gianni,Rinaldi Riccardo,Rossi Elena,Santoliquido Angelo,Sica Simona,Tamburrini Enrica,Teofili Luciana,Testa Antonia,Tosoni Alberto,Trani Carlo,Varone Francesco,Verme Lorenzo Zileri Dal,

Abstract

AbstractThe COVID-19 pandemic is impressively challenging the healthcare system. Several prognostic models have been validated but few of them are implemented in daily practice. The objective of the study was to validate a machine-learning risk prediction model using easy-to-obtain parameters to help to identify patients with COVID-19 who are at higher risk of death. The training cohort included all patients admitted to Fondazione Policlinico Gemelli with COVID-19 from March 5, 2020, to November 5, 2020. Afterward, the model was tested on all patients admitted to the same hospital with COVID-19 from November 6, 2020, to February 5, 2021. The primary outcome was in-hospital case-fatality risk. The out-of-sample performance of the model was estimated from the training set in terms of Area under the Receiving Operator Curve (AUROC) and classification matrix statistics by averaging the results of fivefold cross validation repeated 3-times and comparing the results with those obtained on the test set. An explanation analysis of the model, based on the SHapley Additive exPlanations (SHAP), is also presented. To assess the subsequent time evolution, the change in paO2/FiO2 (P/F) at 48 h after the baseline measurement was plotted against its baseline value. Among the 921 patients included in the training cohort, 120 died (13%). Variables selected for the model were age, platelet count, SpO2, blood urea nitrogen (BUN), hemoglobin, C-reactive protein, neutrophil count, and sodium. The results of the fivefold cross-validation repeated 3-times gave AUROC of 0.87, and statistics of the classification matrix to the Youden index as follows: sensitivity 0.840, specificity 0.774, negative predictive value 0.971. Then, the model was tested on a new population (n = 1463) in which the case-fatality rate was 22.6%. The test model showed AUROC 0.818, sensitivity 0.813, specificity 0.650, negative predictive value 0.922. Considering the first quartile of the predicted risk score (low-risk score group), the case-fatality rate was 1.6%, 17.8% in the second and third quartile (high-risk score group) and 53.5% in the fourth quartile (very high-risk score group). The three risk score groups showed good discrimination for the P/F value at admission, and a positive correlation was found for the low-risk class to P/F at 48 h after admission (adjusted R-squared = 0.48). We developed a predictive model of death for people with SARS-CoV-2 infection by including only easy-to-obtain variables (abnormal blood count, BUN, C-reactive protein, sodium and lower SpO2). It demonstrated good accuracy and high power of discrimination. The simplicity of the model makes the risk prediction applicable for patients in the Emergency Department, or during hospitalization. Although it is reasonable to assume that the model is also applicable in not-hospitalized persons, only appropriate studies can assess the accuracy of the model also for persons at home.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Cited by 15 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3