Revolutionizing Radiological Analysis: The Future of French Language Automatic Speech Recognition in Healthcare

Author:

Jelassi Mariem12ORCID,Jemai Oumaima23,Demongeot Jacques4ORCID

Affiliation:

1. RIADI Laboratory, Ecole Nationale des Sciences de l’Informatique (ENSI), Manouba University, La Manouba 2010, Tunisia

2. Health Tech Innovation Systems Inc., ENSI Innovation Hub, La Manouba 2010, Tunisia

3. Ecole Supérieure des Communications de Tunis (SUP’COM), Carthage University, Ariana 2083, Tunisia

4. AGEIS Laboratory, Faculté de Médecine, Université Grenoble Alpes (UGA), 38700 La Tronche, France

Abstract

This study introduces a specialized Automatic Speech Recognition (ASR) system, leveraging the Whisper Large-v2 model, specifically adapted for radiological applications in the French language. The methodology focused on adapting the model to accurately transcribe medical terminology and diverse accents within the French language context, achieving a notable Word Error Rate (WER) of 17.121%. This research involved extensive data collection and preprocessing, utilizing a wide range of French medical audio content. The results demonstrate the system’s effectiveness in transcribing complex radiological data, underscoring its potential to enhance medical documentation efficiency in French-speaking clinical settings. The discussion extends to the broader implications of this technology in healthcare, including its potential integration with electronic health records (EHRs) and its utility in medical education. This study also explores future research directions, such as tailoring ASR systems to specific medical specialties and languages. Overall, this research contributes significantly to the field of medical ASR systems, presenting a robust tool for radiological transcription in the French language and paving the way for advanced technology-enhanced healthcare solutions.

Publisher

MDPI AG

Reference35 articles.

1. Zapata, J., and Kirkedal, A.S. (2015, January 11–13). Assessing the performance of automatic speech recognition systems when used by native and non-native speakers of three major languages in dictation workflows. Proceedings of the 20th Nordic Conference of Computational Linguistics (NODALIDA 2015), Vilnius, Lithuania. Available online: https://aclanthology.org/W15-1825.pdf.

2. Jelassi, M., Matteli, K., Khalfallah, H.B., and Demongeot, J. (2024). Enhancing Mental Health Support through Artificial Intelligence: Advances in Speech and Text Analysis within Online Therapy Platforms. Preprints, 2024021585.

3. Jiang, Y., and Poellabauer, C. (2021, January 9–12). A Sequence-to-sequence Based Error Correction Model for Medical Automatic Speech Recognition. Proceedings of the 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Houston, TX, USA. Available online: https://ieeexplore.ieee.org/abstract/document/9669554/.

4. Essaid, B., Kheddar, H., Batel, N., Lakas, A., and Chowdhury, M.E. (2024). Advanced Artificial Intelligence Algorithms in Cochlear Implants: Review of Healthcare Strategies, Challenges, and Perspectives. arXiv.

5. Adedeji, A., Joshi, S., and Doohan, B. (2024). The Sound of Healthcare: Improving Medical Transcription ASR Accuracy with Large Language Models. arXiv.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3