Affiliation:
1. Faculty of Informatics, Kaunas University of Technology, 44249 Kaunas, Lithuania
2. Department of Otorhinolaryngology, Academy of Medicine, Lithuanian University of Health Sciences, 44240 Kaunas, Lithuania
Abstract
The problem of cleaning impaired speech is crucial for various applications such as speech recognition, telecommunication, and assistive technologies. In this paper, we propose a novel approach that combines Pareto-optimized deep learning with non-negative matrix factorization (NMF) to effectively reduce noise in impaired speech signals while preserving the quality of the desired speech. Our method begins by calculating the spectrogram of a noisy voice clip and extracting frequency statistics. A threshold is then determined based on the desired noise sensitivity, and a noise-to-signal mask is computed. This mask is smoothed to avoid abrupt transitions in noise levels, and the modified spectrogram is obtained by applying the smoothed mask to the signal spectrogram. We then employ a Pareto-optimized NMF to decompose the modified spectrogram into basis functions and corresponding weights, which are used to reconstruct the clean speech spectrogram. The final noise-reduced waveform is obtained by inverting the clean speech spectrogram. Our proposed method achieves a balance between various objectives, such as noise suppression, speech quality preservation, and computational efficiency, by leveraging Pareto optimization in the deep learning model. The experimental results demonstrate the effectiveness of our approach in cleaning alaryngeal speech signals, making it a promising solution for various real-world applications.
Funder
European Regional Development Fund under grant agreement with the Research Council of Lithuania (LMTLT). 531 Funded as European Union’s measure in response to COVID-19 pandemic
Reference98 articles.
1. An update on larynx cancer;Steuer;CA Cancer J. Clin.,2016
2. Management and Outcome Differences in Supraglottic Cancer Between Ontario, Canada, and the Surveillance, Epidemiology, and End Results Areas of the United States;Groome;J. Clin. Oncol.,2003
3. Explaining Socioeconomic Status Effects in Laryngeal Cancer;Groome;Clin. Oncol.,2006
4. Laryngeal Cancer in the United States: Changes in Demographics, Patterns of Care, and Survival;Hoffman;Laryngoscope,2006
5. NCCN Guidelines® Insights: Head and Neck Cancers, Version 1.2022;Caudell;J. Natl. Compr. Cancer Netw.,2022
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献