Algorithmic Identification of Treatment‐Emergent Adverse Events From Clinical Notes Using Large Language Models: A Pilot Study in Inflammatory Bowel Disease

Author:

Silverman Anna L.12ORCID,Sushil Madhumita3ORCID,Bhasuran Balu3ORCID,Ludwig Dana3ORCID,Buchanan James3ORCID,Racz Rebecca4ORCID,Parakala Mahalakshmi5,El‐Kamary Samer4,Ahima Ohenewaa4,Belov Artur4ORCID,Choi Lauren4,Billings Monisha4ORCID,Li Yan4ORCID,Habal Nadia4,Liu Qi4ORCID,Tiwari Jawahar4,Butte Atul J.36ORCID,Rudrapatna Vivek A.37ORCID

Affiliation:

1. Division of Gastroenterology and Hepatology, Department of Medicine Mayo Clinic Phoenix Arizona USA

2. Department of Medicine University of California, San Diego La Jolla California USA

3. Bakar Computational Health Sciences Institute San Francisco California USA

4. United States Food and Drug Administration Silver Spring Maryland USA

5. Department of Public Health University of California, Berkeley Berkeley California USA

6. Center for Data‐Driven Insights and Innovation University of California Health Oakland California USA

7. Division of Gastroenterology and Hepatology, Department of Medicine University of California, San Francisco San Francisco California USA

Abstract

Outpatient clinical notes are a rich source of information regarding drug safety. However, data in these notes are currently underutilized for pharmacovigilance due to methodological limitations in text mining. Large language models (LLMs) like Bidirectional Encoder Representations from Transformers (BERT) have shown progress in a range of natural language processing tasks but have not yet been evaluated on adverse event (AE) detection. We adapted a new clinical LLM, University of California – San Francisco (UCSF)‐BERT, to identify serious AEs (SAEs) occurring after treatment with a non‐steroid immunosuppressant for inflammatory bowel disease (IBD). We compared this model to other language models that have previously been applied to AE detection. We annotated 928 outpatient IBD notes corresponding to 928 individual patients with IBD for all SAE‐associated hospitalizations occurring after treatment with a non‐steroid immunosuppressant. These notes contained 703 SAEs in total, the most common of which was failure of intended efficacy. Out of eight candidate models, UCSF‐BERT achieved the highest numerical performance on identifying drug‐SAE pairs from this corpus (accuracy 88–92%, macro F1 61–68%), with 5–10% greater accuracy than previously published models. UCSF‐BERT was significantly superior at identifying hospitalization events emergent to medication use (P < 0.01). LLMs like UCSF‐BERT achieve numerically superior accuracy on the challenging task of SAE detection from clinical notes compared with prior methods. Future work is needed to adapt this methodology to improve model performance and evaluation using multicenter data and newer architectures like Generative pre‐trained transformer (GPT). Our findings support the potential value of using large language models to enhance pharmacovigilance.

Funder

U.S. Department of Health and Human Services

National Institutes of Health

Publisher

Wiley

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3