Utilizing Open-Source Language Models and ChatGPT for Zero-Shot Identification of Drug Discontinuation Events in Online Forums: Development and Validation Study (Preprint)

Author:

Trevena WilliamORCID,Zhong XiangORCID,Alvarado MichelleORCID,Semenov AlexanderORCID,Oktay AlpORCID,Devlin DevinORCID,Gohil AaryaORCID,Chittimouju Sai HarshaORCID

Abstract

BACKGROUND

The implementation of Transformer-based Natural Language Processing (NLP) systems, such as BERT and GPT-4, has revolutionized the extraction of insights from unstructured text. These advancements have expanded into healthcare, analyzing social media for public health insights. Yet, drug discontinuation events (DDEs) detection remains underexplored. Identifying DDEs is crucial for understanding medication adherence and patient outcomes.

OBJECTIVE

The objective of this study is to provide a flexible framework for investigating various clinical research questions in data-sparse environments. We exemplify the utility of this framework by identifying DDEs in an open-source online forum, medhelp.org, and by releasing the first open-source DDE datasets to aid further research in this domain.

METHODS

We used pre-trained Transformer-based Natural Language Processing (NLP) models, including ChatGPT, DeBERTa, BART, RoBERTa, DistilRoBERTa, and DistilBERT for zero-shot classification of user comments describing DDEs from medhelp.org.

RESULTS

Among the selected models, BART performed the best by achieving an F1 score of 0.86207, a false positive rate of 2.8%, and a false negative rate of 6.5% without any fine-tuning. The dataset comprised 10.7% DDEs, emphasizing the models’ robustness in an imbalanced data context.

CONCLUSIONS

Our study demonstrates the effectiveness of Transformer-based NLP models, such as ChatGPT and BART, for detecting DDEs from publicly accessible data through zero-shot classification. The robust and scalable framework we propose can aid researchers in addressing data-sparse clinical research questions. The release of open-access DDE datasets stands to stimulate further research and novel discoveries in this area.

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3