Deep Learning Transformer Models for Building a Comprehensive and Real-time Trauma Observatory: Development and Validation Study-Reference-Cited by-同舟云学术

Deep Learning Transformer Models for Building a Comprehensive and Real-time Trauma Observatory: Development and Validation Study

Published:2023-01-12 Issue: Volume:2 Page:e40843
ISSN:2817-1705
Container-title:JMIR AI
language:en
Short-container-title:JMIR AI

Author:

Chenais Gabrielle^ORCID,Gil-Jardiné Cédric^ORCID,Touchais Hélène^ORCID,Avalos Fernandez Marta^ORCID,Contrand Benjamin^ORCID,Tellier Eric^ORCID,Combes Xavier^ORCID,Bourdois Loick^ORCID,Revel Philippe^ORCID,Lagarde Emmanuel^ORCID

Abstract

Background Public health surveillance relies on the collection of data, often in near-real time. Recent advances in natural language processing make it possible to envisage an automated system for extracting information from electronic health records. Objective To study the feasibility of setting up a national trauma observatory in France, we compared the performance of several automatic language processing methods in a multiclass classification task of unstructured clinical notes. Methods A total of 69,110 free-text clinical notes related to visits to the emergency departments of the University Hospital of Bordeaux, France, between 2012 and 2019 were manually annotated. Among these clinical notes, 32.5% (22,481/69,110) were traumas. We trained 4 transformer models (deep learning models that encompass attention mechanism) and compared them with the term frequency–inverse document frequency associated with the support vector machine method. Results The transformer models consistently performed better than the term frequency–inverse document frequency and a support vector machine. Among the transformers, the GPTanam model pretrained with a French corpus with an additional autosupervised learning step on 306,368 unlabeled clinical notes showed the best performance with a micro F1-score of 0.969. Conclusions The transformers proved efficient at the multiclass classification of narrative and medical data. Further steps for improvement should focus on the expansion of abbreviations and multioutput multiclass classification.

Publisher

JMIR Publications Inc.

Reference48 articles.

1. SurSaUD® Software: A Tool to Support the Data Management, the Analysis and the Dissemination of Results from the French Syndromic Surveillance System

2. Assessment of a Syndromic Surveillance System Based on Morbidity Data: Results from the Oscour® Network during a Heat Wave

3. Mapping influenza activity in emergency departments in France using Bayesian model-based geostatistics

4. Retrospective observational study of emergency department syndromic surveillance data during air pollution episodes across London and Paris in 2014

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evaluating the Capabilities of Generative AI Tools in Understanding Medical Papers: Qualitative Study;JMIR Medical Informatics;2024-09-04

2. The Role of Large Language Models in Transforming Emergency Medicine: Scoping Review;JMIR Medical Informatics;2024-05-10

3. Harnessing Moderate-Sized Language Models for Reliable Patient Data De-identification in Emergency Department Records: An Evaluation of Strategies and Performance (Preprint);2024-02-28