Gender Bias in Artificial Intelligence: Severity Prediction at an Early Stage of COVID-19-Reference-Cited by-同舟云学术

Gender Bias in Artificial Intelligence: Severity Prediction at an Early Stage of COVID-19

Published:2021-11-29 Issue: Volume:12 Page:
ISSN:1664-042X
Container-title:Frontiers in Physiology
language:
Short-container-title:Front. Physiol.

Author:

Chung Heewon,Park Chul,Kang Wu Seong,Lee Jinseok

Abstract

Artificial intelligence (AI) technologies have been applied in various medical domains to predict patient outcomes with high accuracy. As AI becomes more widely adopted, the problem of model bias is increasingly apparent. In this study, we investigate the model bias that can occur when training a model using datasets for only one particular gender and aim to present new insights into the bias issue. For the investigation, we considered an AI model that predicts severity at an early stage based on the medical records of coronavirus disease (COVID-19) patients. For 5,601 confirmed COVID-19 patients, we used 37 medical records, namely, basic patient information, physical index, initial examination findings, clinical findings, comorbidity diseases, and general blood test results at an early stage. To investigate the gender-based AI model bias, we trained and evaluated two separate models—one that was trained using only the male group, and the other using only the female group. When the model trained by the male-group data was applied to the female testing data, the overall accuracy decreased—sensitivity from 0.93 to 0.86, specificity from 0.92 to 0.86, accuracy from 0.92 to 0.86, balanced accuracy from 0.93 to 0.86, and area under the curve (AUC) from 0.97 to 0.94. Similarly, when the model trained by the female-group data was applied to the male testing data, once again, the overall accuracy decreased—sensitivity from 0.97 to 0.90, specificity from 0.96 to 0.91, accuracy from 0.96 to 0.91, balanced accuracy from 0.96 to 0.90, and AUC from 0.97 to 0.95. Furthermore, when we evaluated each gender-dependent model with the test data from the same gender used for training, the resultant accuracy was also lower than that from the unbiased model.

Funder

National Research Foundation of Korea

Publisher

Frontiers Media SA

Subject

Physiology (medical),Physiology

Reference20 articles.

1. Measuring the gender and ethnicity bias in deep models for face recognition;Acien;Proceedings of the Congress on Pattern Recognition,2018

2. A novel severity score to predict inpatient mortality in COVID-19 patients.;Altschul;Sci. Rep.,2020

3. Random forests.;Breiman;Mach. Learn.,2001

4. Xgboost: a scalable tree boosting system;Chen;Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,2016

5. Prediction and feature importance analysis for severity of COVID-19 in south korea using artificial intelligence: model development and validation.;Chung;J. Med. Internet Res.,2021

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploración del sesgo de género en la clasificación de ocupaciones de Colombia utilizando aprendizaje automático;REVISTA COLOMBIANA DE TECNOLOGIAS DE AVANZADA (RCTA);2024-07-19

2. Assessing the Accuracy of Artificial Intelligence Models in Scoliosis Classification and Suggested Therapeutic Approaches;Journal of Clinical Medicine;2024-07-09

3. Good machine learning practices: Learnings from the modern pharmaceutical discovery enterprise;Computers in Biology and Medicine;2024-07

4. Factors for Customers’ AI Use Readiness in Physical Retail Stores: The Interplay of Consumer Attitudes and Gender Differences;Information;2024-06-12

5. Sociodemographic reporting in videomics research: a review of practices in otolaryngology - head and neck surgery;European Archives of Oto-Rhino-Laryngology;2024-05-05