Classification and prediction of spinal disease based on the SMOTE-RFE-XGBoost model

Author:

Zhang Biao1,Dong Xinyan2,Hu Yuwei2,Jiang Xuchu2,Li Gongchi3

Affiliation:

1. School of Computer Science, Liaocheng University, Liaocheng, Shandong, China

2. School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan, Hubei, China

3. Union Hospital Affiliated to Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China

Abstract

Spinal diseases are killers that cause long-term disturbance to people with complex and diverse symptoms and may cause other conditions. At present, the diagnosis and treatment of the main diseases mainly depend on the professional level and clinical experience of doctors, which is a breakthrough problem in the field of medicine. This article proposes the SMOTE-RFE-XGBoost model, which takes the physical angle of human bone as the research index for feature selection and classification model construction to predict spinal diseases. The research process is as follows: two groups of people with normal and abnormal spine conditions are taken as the research objects of this article, and the synthetic minority oversampling technique (SMOTE) algorithm is used to address category imbalance. Three methods, least absolute shrinkage and selection operator (LASSO), tree-based feature selection, and recursive feature elimination (RFE), are used for feature selection. Logistic regression (LR), support vector machine (SVM), parsimonious Bayes, decision tree (DT), random forest (RF), gradient boosting tree (GBT), extreme gradient boosting (XGBoost), and ridge regression models are used to classify the samples, construct single classification models and combine classification models and rank the feature importance. According to the accuracy and mean square error (MSE) values, the SMOTE-RFE-XGBoost combined model has the best classification, with accuracy, MSE and F1 values of 97.56%, 0.1111 and 0.8696, respectively. The importance of four indicators, lumbar slippage, cervical tilt, pelvic radius and pelvic tilt, was higher.

Funder

Natural Science Foundations of Shandong Province

“Guangyue Young Scholar Innovation Team” of Liaocheng University

Publisher

PeerJ

Subject

General Computer Science

Reference27 articles.

1. Natural history of the aging spine: a cross-sectional analysis of spinopelvic parameters in the asymptomatic population;Attiah;Journal of Neurosurgery: Spine,2019

2. Diseases of The Spine and Spinal Cord

3. Machine learning in orthopedics: a literature review;Cabitza;Frontiers in Bioengineering and Biotechnology,2018

4. User online purchase behavior prediction based on fusion model of CatBoost and Logit;Cao;Journal of Physics: Conference Series. IOP Publishing,2021

5. SMOTE: synthetic minority over-sampling technique;Chawla;Journal of Artificial Intelligence Research,2002

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3