Accuracy of ChatGPT‐Generated Information on Head and Neck and Oromaxillofacial Surgery: A Multicenter Collaborative Analysis

Author:

Vaira Luigi Angelo12ORCID,Lechien Jerome R.34,Abbate Vincenzo5,Allevi Fabiana6,Audino Giovanni5,Beltramini Giada Anna78,Bergonzani Michela9,Bolzoni Alessandro7,Committeri Umberto5,Crimi Salvatore10,Gabriele Guido11,Lonardi Fabio12,Maglitto Fabio13,Petrocelli Marzia14,Pucci Resi15,Saponaro Gianmarco16,Tel Alessandro17,Vellone Valentino18,Chiesa‐Estomba Carlos Miguel19,Boscolo‐Rizzo Paolo20,Salzano Giovanni5,De Riu Giacomo1

Affiliation:

1. Maxillofacial Surgery Operative Unit, Department of Medicine, Surgery and Pharmacy University of Sassari Sassari Italy

2. Biomedical Sciences Department PhD School of Biomedical Science, University of Sassari Sassari Italy

3. Department of Anatomy and Experimental Oncology Mons School of Medicine, UMONS, Research Institute for Health Sciences and Technology, University of Mons (UMons) Mons Belgium

4. Department of Otolaryngology–Head Neck Surgery Elsan Polyclinic of Poitiers Poitiers France

5. Head and Neck Section, Department of Neurosciences, Reproductive and Odontostomatological Science Federico II University of Naples Naples Italy

6. Maxillofacial Surgery Department ASSt Santi Paolo e Carlo, University of Milan Milan Italy

7. Department of Biomedical, Surgical and Dental Sciences University of Milan Milan Italy

8. Maxillofacial and Dental Unit Fondazione IRCCS Cà Granda Ospedale Maggiore Policlinico Milan Italy

9. Maxillo‐Facial Surgery Division, Head and Neck Department University Hospital of Parma Parma Italy

10. Operative Unit of Maxillofacial Surgery Policlinico San Marco, University of Catania Catania Italy

11. Department of Maxillofacial Surgery University of Siena Siena Italy

12. Department of Maxillofacial Surgery University of Verona Verona Italy

13. Maxillo‐Facial Surgery Unit University of Bari “Aldo Moro” Bari Italy

14. Maxillofacial Surgery Operative Unit Bellaria and Maggiore Hospital Bologna Italy

15. Maxillofacial Surgery Unit San Camillo‐Forlanini Hospital Rome Italy

16. Maxillo‐Facial Surgery Unit IRCSS “A. Gemelli” Foundation—Catholic, University of the Sacred Heart Rome Italy

17. Department of Head and Neck Surgery and Neuroscience Clinic of Maxillofacial Surgery, University Hospital of Udine Udine Italy

18. Maxillofacial Surgery Unit “S. Maria” Hospital Terni Italy

19. Department of Otorhinolaryngology–Head and Neck Surgery Hospital Universitario Donostia San Sebastian Spain

20. Department of Medical, Surgical and Health Sciences, Section of Otolaryngology University of Trieste Trieste Italy

Abstract

AbstractObjectiveTo investigate the accuracy of Chat‐Based Generative Pre‐trained Transformer (ChatGPT) in answering questions and solving clinical scenarios of head and neck surgery.Study DesignObservational and valuative study.SettingEighteen surgeons from 14 Italian head and neck surgery units.MethodsA total of 144 clinical questions encompassing different subspecialities of head and neck surgery and 15 comprehensive clinical scenarios were developed. Questions and scenarios were inputted into ChatGPT4, and the resulting answers were evaluated by the researchers using accuracy (range 1‐6), completeness (range 1‐3), and references' quality Likert scales.ResultsThe overall median score of open‐ended questions was 6 (interquartile range[IQR]: 5‐6) for accuracy and 3 (IQR: 2‐3) for completeness. Overall, the reviewers rated the answer as entirely or nearly entirely correct in 87.2% of cases and as comprehensive and covering all aspects of the question in 73% of cases. The artificial intelligence (AI) model achieved a correct response in 84.7% of the closed‐ended questions (11 wrong answers). As for the clinical scenarios, ChatGPT provided a fully or nearly fully correct diagnosis in 81.7% of cases. The proposed diagnostic or therapeutic procedure was judged to be complete in 56.7% of cases. The overall quality of the bibliographic references was poor, and sources were nonexistent in 46.4% of the cases.ConclusionThe results generally demonstrate a good level of accuracy in the AI's answers. The AI's ability to resolve complex clinical scenarios is promising, but it still falls short of being considered a reliable support for the decision‐making process of specialists in head‐neck surgery.

Publisher

Wiley

Subject

Otorhinolaryngology,Surgery

Reference38 articles.

1. OpenAI. ChatGPT. 2023. Accessed March 28 2023.https://openai.com/blog/chatgpt

2. Exploding Topics. Number of ChatGPT users 2023. 2023. Accessed March 30 2023.https://explodingtopics.com/blog/chatgpt-users

3. Appropriateness of recommendations provided by ChatGPT to interventional radiologists;Barat M;Can Assoc Radiol J

4. The potential impact of ChatGPT/GPT-4 on surgery: will it topple the profession of surgeons?

5. Performance of ChatGPT on free‐response, clinical reasoning exams;Strong E;medRxiv

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3