Disparities in seizure outcomes revealed by large language models

Author:

Xie KevinORCID,Ojemann William K.S.ORCID,Gallagher Ryan S.,Lucas AlfredoORCID,Hill Chloé E.,Hamilton Roy H.,Johnson Kevin B.,Roth DanORCID,Litt BrianORCID,Ellis Colin A.ORCID

Abstract

AbstractObjectiveLarge-language models (LLMs) in healthcare have the potential to propagate existing biases or introduce new ones. For people with epilepsy, social determinants of health are associated with disparities in access to care, but their impact on seizure outcomes among those with access to specialty care remains unclear. Here we (1) evaluated our validated, epilepsy-specific LLM for intrinsic bias, and (2) used LLM-extracted seizure outcomes to test the hypothesis that different demographic groups have different seizure outcomes.MethodsFirst, we tested our LLM for intrinsic bias in the form of differential performance in demographic groups by race, ethnicity, sex, income, and health insurance in manually annotated notes. Next, we used LLM-classified seizure freedom at each office visit to test for outcome disparities in the same demographic groups, using univariable and multivariable analyses.ResultsWe analyzed 84,675 clinic visits from 25,612 patients seen at our epilepsy center 2005-2022. We found no differences in the accuracy, or positive or negative class balance of outcome classifications across demographic groups. Multivariable analysis indicated worse seizure outcomes for female patients (OR 1.33, p = 3x10-8), those with public insurance (OR 1.53, p = 2x10-13), and those from lower-income zip codes (OR ≥ 1.22, p ≤ 6.6x10-3). Black patients had worse outcomes than White patients in univariable but not multivariable analysis (OR 1.03, p = 0.66).SignificanceWe found no evidence that our LLM was intrinsically biased against any demographic group. Seizure freedom extracted by LLM revealed disparities in seizure outcomes across several demographic groups. These findings highlight the critical need to reduce disparities in the care of people with epilepsy.Key PointsWe used large language models (LLMs) and natural language processing to extract seizure outcomes from clinical note text.We found no evidence of intrinsic bias in the LLM algorithm, in that it performed similarly across all demographic groups.Using LLM-extracted seizure outcomes, female sex, public insurance, and lower income zip- codes were associated with higher likelihood of seizures at each visit.Black race was associated with higher likelihood of seizures in univariable but not multivariable analysis.These findings highlight the critical need to reduce disparities in the care of people with epilepsy.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3