The Consistency and Quality of ChatGPT Responses Compared to Clinical Guidelines for Ovarian Cancer: A Delphi Approach

Author:

Piazza Dario1ORCID,Martorana Federica2ORCID,Curaba Annabella1,Sambataro Daniela3,Valerio Maria Rosaria4,Firenze Alberto5,Pecorino Basilio67,Scollo Paolo67ORCID,Chiantera Vito8,Scibilia Giuseppe9ORCID,Vigneri Paolo1011,Gebbia Vittorio112,Scandurra Giuseppa13

Affiliation:

1. Medical Oncology Unit, Casa di Cura Torina, 90145 Palermo, Italy

2. Department of Clinical and Experimental Medicine, University of Catania, 95124 Catania, Italy

3. Medical Oncology Unit, Ospedale Umberto I, 94100 Enna, Italy

4. Medical Oncology Unit, Policlinico P. Giaccone, University of Palermo, 90133 Palermo, Italy

5. Occupational Health Section, Department of Health Promotion, Mother and Child Care, Internal Medicine and Medical Specialties, University of Palermo, 90133 Palermo, Italy

6. Gynecology Unit, Ospedale Cannizzaro, 95126 Catania, Italy

7. Gynecology, Faculty of Medicine and Surgery, University of Enna Kore, 94100 Enna, Italy

8. Gynecology, University of Palermo, 90133 Palermo, Italy

9. Gynecology Unit, Ospedale Paternò Arezzo, 97100 Ragusa, Italy

10. Medical Oncology, University of Catania, 95124 Catania, Italy

11. Medical Oncology, Istituto Clinico Humanitas, 95045 Catania, Italy

12. Medical Oncology, Faculty of Medicine and Surgery, University of Enna Kore, 94100 Enna, Italy

13. Medical Oncology Unit, Ospedale Cannizzaro, 95126 Catania, Italy

Abstract

Introduction: In recent years, generative Artificial Intelligence models, such as ChatGPT, have increasingly been utilized in healthcare. Despite acknowledging the high potential of AI models in terms of quick access to sources and formulating responses to a clinical question, the results obtained using these models still require validation through comparison with established clinical guidelines. This study compares the responses of the AI model to eight clinical questions with the Italian Association of Medical Oncology (AIOM) guidelines for ovarian cancer. Materials and Methods: The authors used the Delphi method to evaluate responses from ChatGPT and the AIOM guidelines. An expert panel of healthcare professionals assessed responses based on clarity, consistency, comprehensiveness, usability, and quality using a five-point Likert scale. The GRADE methodology assessed the evidence quality and the recommendations’ strength. Results: A survey involving 14 physicians revealed that the AIOM guidelines consistently scored higher averages compared to the AI models, with a statistically significant difference. Post hoc tests showed that AIOM guidelines significantly differed from all AI models, with no significant difference among the AI models. Conclusions: While AI models can provide rapid responses, they must match established clinical guidelines regarding clarity, consistency, comprehensiveness, usability, and quality. These findings underscore the importance of relying on expert-developed guidelines in clinical decision-making and highlight potential areas for AI model improvement.

Publisher

MDPI AG

Reference28 articles.

1. Ovarian Cancer, Version 2.2020, NCCN Clinical Practice Guidelines in Oncology;Armstrong;J. Natl. Compr. Cancer Netw.,2021

2. (2024, February 26). I Numeri Del Cancro 2023. Associazione Italiana Registri Tumori. Available online: https://www.registri-tumori.it/cms/notizie/i-numeri-del-cancro-2023.

3. (2024, February 07). National Comprehensive Cancer Network—Home. Available online: https://www.nccn.org.

4. ESMO–ESGO Consensus Conference Recommendations on Ovarian Cancer: Pathology and Molecular Biology, Early and Advanced Stages, Borderline Tumours and Recurrent Disease;Colombo;Ann. Oncol.,2019

5. (2024, February 07). Linee Guida Carcinoma Dell’ovaio. Available online: https://www.aiom.it/linee-guida-aiom-2021-carcinoma-dellovaio/.

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3