Prediction Model for Lymph Node Metastasis in Papillary Thyroid Carcinoma Based on Electronic Medical Records

Author:

Zhang JingWen1,Zhang XiaoWen2,Xia ShuJun1,Dong YiJie1,Zhou Wei1,Liu ZhenHua1,Zhang Lu1,Zhan WeiWei1,Sun YuZhong2,Zhou JianQiao1

Affiliation:

1. Ruijin Hospital, Shanghai Jiaotong University School of Medicine

2. Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080

Abstract

Abstract Purpose This study aimed to establish a novel machine learning model for predicting lymph node metastasis(LNM)of patients with papillary thyroid carcinoma (PTC) by utilizing personal electronic medical records (EMR) data. Methods The study included 5076 PTC patients underwent total thyroidectomy or lobectomy with lymph node dissection. Based on the integrated learning approach, this study designed a predictive model for LNM. The predictive model employs deep neural network (DNN) models to identify features within cases and vectorize clinical data from electronic medical records into feature matrices. Subsequently, a classifier based on machine learning algorithms is designed to analyse the feature matrices for prediction LNM in PTC. To mitigate the risk of overfitting commonly associated with machine learning algorithms processing high-dimensional matrices, multiple DNNS are utilized to distribute the overfitting risk. Five mainstream machine learning algorithms (NB, DT, XGB, GBM, RDF) are tested as classifier algorithms in the predictive model. Model performance is assessed using precision, recall, F1, and AUC. Results Among the patients, 2,261 had lymph node metastasis (LNM), with 2,196 displaying central lymph node metastasis (CLNM) and 472 exhibiting lateral cervical lymph node metastasis (LLNM). The RDF model showcased superior predictive performance compared to other models, achieving a testing AUC of 0.98, precision of 0.98, recall of 0.95, and F1 value of 0.97 in predicting LNM. Moreover, it attained an AUC of 0.98, precision of 0.98, recall of 0.94, and an F1 value of 0.96 in predicting CLNM. Regarding the weighting of the feature matrix for various case data types, gender and multi-focus held higher weights, at 1.24 and 1.23 respectively. Conclusion The LNM predictive model proposed in this study could be used as a cost-effective tool for predicting LNM in PTC patients, by utilizing easily available personal electronic medical data, which can provide valuable support to surgeons in devising a personalized treatment plan.

Publisher

Research Square Platform LLC

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3