Predictive model and risk analysis for diabetic retinopathy using machine learning: a retrospective cohort study in China

Author:

Li WanyueORCID,Song YananORCID,Chen Kang,Ying Jun,Zheng Zhong,Qiao Shen,Yang Ming,Zhang Maonian,Zhang Ying

Abstract

ObjectiveAiming to investigate diabetic retinopathy (DR) risk factors and predictive models by machine learning using a large sample dataset.DesignRetrospective study based on a large sample and a high dimensional database.SettingA Chinese central tertiary hospital in Beijing.ParticipantsInformation on 32 452 inpatients with type-2 diabetes mellitus (T2DM) were retrieved from the electronic medical record system from 1 January 2013 to 31 December 2017.MethodsSixty variables (including demography information, physical and laboratory measurements, system diseases and insulin treatments) were retained for baseline analysis. The optimal 17 variables were selected by recursive feature elimination. The prediction model was built based on XGBoost algorithm, and it was compared with three other popular machine learning techniques: logistic regression, random forest and support vector machine. In order to explain the results of XGBoost model more visually, the Shapley Additive exPlanation (SHAP) method was used.ResultsDR occurred in 2038 (6.28%) T2DM patients. The XGBoost model was identified as the best prediction model with the highest AUC (area under the curve value, 0.90) and showed that an HbA1c value greater than 8%, nephropathy, a serum creatinine value greater than 100 µmol/L, insulin treatment and diabetic lower extremity arterial disease were associated with an increased risk of DR. A patient’s age over 65 was associated with a decreased risk of DR.ConclusionsWith better comprehensive performance, XGBoost model had high reliability to assess risk indicators of DR. The most critical risk factors of DR and the cut-off of risk factors can be found by SHAP method to render the output of the XGBoost model clinically interpretable.

Funder

Chinese PLA general hospital medical big data program

Publisher

BMJ

Subject

General Medicine

Reference48 articles.

1. Important Causes of Visual Impairment in the World Today

2. Prevalence of diabetic retinopathy, proliferative diabetic retinopathy and non-proliferative diabetic retinopathy in Asian T2DM patients: a systematic review and meta-analysis;Yang;Int J Ophthalmol,2019

3. The inflammasome in chronic complications of diabetes and related metabolic disorders;Menini;Cells,2020

4. The role of reactive oxygen species in the pathogenesis and treatment of retinal diseases;Chan;Exp Eye Res,2020

5. Retinal capillary basement membrane thickening: role in the pathogenesis of diabetic retinopathy;Roy;Prog Retin Eye Res,2021

Cited by 19 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3