Comparing neural language models for medical concept representation and patient trajectory prediction-Reference-Cited by-同舟云学术

Comparing neural language models for medical concept representation and patient trajectory prediction

Published:2023-06-05 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Bornet Alban^ORCID,Proios Dimitrios^ORCID,Yazdani Anthony^ORCID,Jaume-Santero Fernando^ORCID,Haller Guy^ORCID,Choi Edward^ORCID,Teodoro Douglas^ORCID

Abstract

AbstractEffective representation of medical concepts is crucial for secondary analyses of electronic health records. Neural language models have shown promise in automatically deriving medical concept representations from clinical data. However, the comparative performance of different language models for creating these empirical representations, and the extent to which they encode medical semantics, has not been extensively studied. This study aims to address this gap by evaluating the effectiveness of three popular language models - word2vec, fastText, and GloVe - in creating medical concept embeddings. By using a large dataset of digital health records, we created patient trajectories and used them to train the language models. We then assessed the ability of the learned embeddings to encode semantics through an explicit comparison with biomedical terminologies, and implicitly by predicting patient outcomes and trajectories with different degrees of information. Our qualitative analysis shows that empirical clusters of embeddings learned by fastText exhibit the highest similarity with theoretical clustering patterns obtained from biomedical terminologies, with a similarity score between empirical and theoretical clusters of 0.88, 0.80, and 0.92 for diagnosis, procedures, and medication codes, respectively. Conversely, for outcome prediction, word2vec and GloVe tend to outperform fastText, with the former achieving AUROC as high as 0.80, 0.63, and 0.88 for length-of-stay, readmission, and mortality prediction, respectively. In predicting the next steps in patient trajectories, GloVe achieves the highest performance for diagnostic and medication codes (AUPRC of 0.46 and of 0.82, respectively) at the highest level of the semantic hierarchy, while fastText outperforms the other models for procedure codes (AUPRC of 0.67). Our study demonstrates that subword information is crucial for learning medical concept representations, but global embedding vectors are better suited for downstream tasks, such as trajectory prediction. Thus, these models can be harnessed to learn representations that convey clinical meaning, and our insights highlight the potential of using machine learning techniques to semantically encode medical data.

Publisher

Cold Spring Harbor Laboratory

Reference103 articles.

1. Clinical data reuse or secondary use: current status and potential future progress;Yearbook of medical informatics,2017

2. Measuring Diagnoses: ICD Code Accuracy

3. McGinnis JM , Stuckhardt L , Saunders R , Smith M (2013) Best care at lower cost: the path to continuously learning health care in America.

4. Project HC and U (2016) Clinical classifications software (CCS) for ICD-9-CM.

5. Mining electronic health records: towards better research applications and clinical care

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Zero Shot Health Trajectory Prediction Using Transformer;2024-03-04