MF-MNER: Multi-models Fusion for MNER in Chinese Clinical Electronic Medical Records-Reference-Cited by-同舟云学术

MF-MNER: Multi-models Fusion for MNER in Chinese Clinical Electronic Medical Records

Published:2024-04-05 Issue:2 Volume:16 Page:489-502
ISSN:1913-2751
Container-title:Interdisciplinary Sciences: Computational Life Sciences
language:en
Short-container-title:Interdiscip Sci Comput Life Sci

Author:

Du Haoze,Xu Jiahao,Du Zhiyong,Chen Lihui,Ma Shaohui,Wei Dongqing,Wang Xianfang^ORCID

Abstract

AbstractTo address the problem of poor entity recognition performance caused by the lack of Chinese annotation in clinical electronic medical records, this paper proposes a multi-medical entity recognition method F-MNER using a fusion technique combining BART, Bi-LSTM, and CRF. First, after cleaning, encoding, and segmenting the electronic medical records, the obtained semantic representations are dynamically fused using a bidirectional autoregressive transformer (BART) model. Then, sequential information is captured using a bidirectional long short-term memory (Bi-LSTM) network. Finally, the conditional random field (CRF) is used to decode and output multi-task entity recognition. Experiments are performed on the CCKS2019 dataset, with micro avg Precision, macro avg Recall, weighted avg Precision reaching 0.880, 0.887, and 0.883, and micro avg F1-score, macro avg F1-score, weighted avg F1-score reaching 0.875, 0.876, and 0.876 respectively. Compared with existing models, our method outperforms the existing literature in three evaluation metrics (micro average, macro average, weighted average) under the same dataset conditions. In the case of weighted average, the Precision, Recall, and F1-score are 19.64%, 15.67%, and 17.58% higher than the existing BERT-BiLSTM-CRF model respectively. Experiments are performed on the actual clinical dataset with our MF-MNER, the Precision, Recall, and F1-score are 0.638, 0.825, and 0.719 under the micro-avg evaluation mechanism. The Precision, Recall, and F1-score are 0.685, 0.800, and 0.733 under the macro-avg evaluation mechanism. The Precision, Recall, and F1-score are 0.647, 0.825, and 0.722 under the weighted avg evaluation mechanism. The above results show that our method MF-MNER can integrate the advantages of BART, Bi-LSTM, and CRF layers, significantly improving the performance of downstream named entity recognition tasks with a small amount of annotation, and achieving excellent performance in terms of recall score, which has certain practical significance. Source code and datasets to reproduce the results in this paper are available at https://github.com/xfwang1969/MF-MNER. Graphical Abstract Illustration of the proposed MF-MNER. The method mainly includes four steps: (1) medical electronic medical records need to be cleared, coded, and segmented. (2) The semantic representation obtained by dynamic fusion of the bidirectional autoregressive converter (BART) model. (3) The sequence information is captured by a bi-directional short-term memory (Bi-LSTM) network. (4) the multi-task entity recognition is decoded and output by conditional random field (CRF).

Funder

National Natural Science Foundation of China

Intergovernmental International Scientific and Technological Innovation and Cooperation Program of The National Key R&D Program

Joint Research Funds for Medical and Engineering and Scientific Research at Shanghai Jiao Tong University

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s12539-024-00624-z.pdf

Reference37 articles.

1. Janett RS, Yeracaris PP (2020) Electronic Medical Records in the American Health System: challenges and lessons learned. Ciencia Saude Coletiva 25(4):1293–1304. https://doi.org/10.1590/1413-81232020254.28922019

2. Cerchione R, Centobelli P, Riccio E et al (2023) Blockchain’s coming to hospital to digitalize healthcare services: Designing a distributed electronic health record ecosystem. Technovation 120:102480. https://doi.org/10.1016/j.technovation.2022.102480

3. Edara DC, Vanukuri LP, Sistla V et al (2023) Sentiment analysis and text categorization of cancer medical records with LSTM. J Ambient Intell Humaniz Comput 14(5):5309–5325. https://doi.org/10.1007/s12652-019-01399-8

4. Sutton RT, Pincock D, Baumgart DC et al (2020) An overview of clinical decision support systems: benefits, risks, and strategies for success. NPJ Digit Med 3(1):17. https://doi.org/10.1038/s41746-020-0221-y

5. Desai RJ, Wang SV, Vaduganathan M et al (2020) Comparison of machine learning methods with traditional models for use of administrative claims with electronic medical records to predict heart failure outcomes. JAMA Netw Open 3(1):e1918962. https://doi.org/10.1001/jamanetworkopen.2019.18962

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Recognition of Chinese Electronic Medical Records for Rehabilitation Robots: Information Fusion Classification Strategy;Sensors;2024-08-30