Use of feature importance statistics to accurately predict asthma attacks using machine learning: A cross-sectional cohort study of the US population

Author:

Huang Alexander A.,Huang Samuel Y.ORCID

Abstract

Background Asthma attacks are a major cause of morbidity and mortality in vulnerable populations, and identification of associations with asthma attacks is necessary to improve public awareness and the timely delivery of medical interventions. Objective The study aimed to identify feature importance of factors associated with asthma in a representative population of US adults. Methods A cross-sectional analysis was conducted using a modern, nationally representative cohort, the National Health and Nutrition Examination Surveys (NHANES 2017–2020). All adult patients greater than 18 years of age (total of 7,922 individuals) with information on asthma attacks were included in the study. Univariable regression was used to identify significant nutritional covariates to be included in a machine learning model and feature importance was reported. The acquisition and analysis of the data were authorized by the National Center for Health Statistics Ethics Review Board. Results 7,922 patients met the inclusion criteria in this study. The machine learning model had 55 out of a total of 680 features that were found to be significant on univariate analysis (P<0.0001 used). In the XGBoost model the model had an Area Under the Receiver Operator Characteristic Curve (AUROC) = 0.737, Sensitivity = 0.960, NPV = 0.967. The top five highest ranked features by gain, a measure of the percentage contribution of the covariate to the overall model prediction, were Octanoic Acid intake as a Saturated Fatty Acid (SFA) (gm) (Gain = 8.8%), Eosinophil percent (Gain = 7.9%), BMXHIP–Hip Circumference (cm) (Gain = 7.2%), BMXHT–standing height (cm) (Gain = 6.2%) and HS C-Reactive Protein (mg/L) (Gain 6.1%). Conclusion Machine Learning models can additionally offer feature importance and additional statistics to help identify associations with asthma attacks.

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference53 articles.

1. Is the prevalence of asthma declining? Systematic review of epidemiological studies;C Anandan;Allergy,2010

2. Predicting frequent asthma exacerbations using blood eosinophil count and other patient data routinely available in clinical practice;D Price;J Asthma Allergy,2016

3. Identifying Risk of Future Asthma Attacks Using UK Medical Record Data: A Respiratory Effectiveness Group Initiative;JD Blakey;J Allergy Clin Immunol Pract,2017

4. The use of administrative data to risk-stratify asthmatic patients;J Grana;Am J Med Qual,1997

5. The epidemiology, healthcare and societal burden and costs of asthma in the UK and its member nations: analyses of standalone and linked national databases.;M Mukherjee;BMC Med.,2016

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3