Machine learning driven identification of gene-expression signatures correlated with multiple organ dysfunction trajectories and complex sub-endotypes of pediatric septic shock

Author:

Atreya Mihir R.1,Banerjee Shayantan2,Lautz Andrew J.1,Alder Matthew N.1,Varisco Brian M.1,wong hector1,Muszynski Jennifer A.3,Hall Mark W.3,Sanchez-Pinto L. Nelson4,Kamaleswaran Rishikesan5

Affiliation:

1. Cincinnati Children's Hospital Medical Center, Cincinnati Children's Research Foundation

2. Indian Institute of Technology Madras

3. Nationwide Children’s Hospital

4. Northwestern University Feinberg School of Medicine

5. Emory University School of Medicine

Abstract

Abstract Background Multiple organ dysfunction syndrome (MODS) disproportionately drives sepsis morbidity and mortality among children. The biology of this heterogeneous syndrome is complex, dynamic, and incompletely understood. Gene expression signatures correlated with MODS trajectories may facilitate identification of molecular targets and predictive enrichment. Methods Secondary analyses of publicly available datasets. (1) Supervised machine learning (ML) was used to identify genes correlated with persistent MODS relative to those without in the derivation cohort. Model performances were tested across 4 validation cohorts, among children and adults with differing inciting cause for organ dysfunctions, to identify a stable set of genes and fixed classification model to reliably estimate the risk of MODS. Clinical propensity scores, where available, were used to enhance model performance. (2) We identified organ-specific dysfunction signatures by eliminating redundancies between the shared MODS signature and those of individual organ dysfunctions. (3) Finally, novel patient subclasses were identified through unsupervised hierarchical clustering of genes correlated with persistent MODS and compared with previously established pediatric septic shock endotypes. Results 568 genes were differentially expressed, among which ML identified 109 genes that were consistently correlated with persistent MODS. The AUROC of a model that incorporated the stable features chosen from repeated cross-validation experiments to estimate risk of MODS was 0.87 (95% CI: 0.85–0.88). Model performance using the top 20 genes and an ExtraTree classification model yielded AUROCs ranging 0.77–0.96 among validation cohorts. Genes correlated with day 3 and 7 cardiovascular, respiratory, and renal dysfunctions were identified. Finally, the top 50 genes were used to discover four novel subclasses, of which patients belonging to M1 and M2 had the worst clinical outcomes. Reactome pathway analyses revealed a potential role of transcription factor RUNX1 in distinguishing subclasses. Interaction with receipt of adjuvant steroids suggested that newly derived M1 and M2 endotypes were biologically distinct relative to established endotypes. Conclusions Our data suggest the existence of complex sub-endotypes among children with septic shock wherein overlapping biological pathways may be linked to differential response to therapies. Future studies in cohorts enriched for patients with MODS may facilitate discovery and development of disease modifying therapies for subsets of critically ill children with sepsis.

Publisher

Research Square Platform LLC

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3