A Review on Speech Recognition for Under-Resourced Languages-Reference-Cited by-同舟云学术

A Review on Speech Recognition for Under-Resourced Languages

Published:2023-10-27 Issue:1 Volume:15 Page:1-16
ISSN:1947-8208
Container-title:International Journal of Knowledge and Systems Science
language:ng
Short-container-title:

Author:

Phung Trung-Nghia¹,Nguyen Duc-Binh¹,Pham Ngoc-Phuong²

Affiliation:

1. Thai Nguyen University of Information and Communication Technology, Vietnam

2. Thai Nguyen University, Vietnam

Abstract

Fundamental speech recognition technologies for high-resourced languages are currently successful to build high-quality applications with the use of deep learning models. However, the problem of “borrowing” these speech recognition technologies for under-resourced languages like Vietnamese still has challenges. This study reviews fundamental studies on speech recognition in general as well as speech recognition in Vietnamese, an under-resourced language in particular. Then, it specifies the urgent issues that need current research attention to build Vietnamese speech recognition applications in practice, especially the need to build an open large sentence-labeled speech corpus and open platform for related research, which mostly benefits small individuals/organizations who do not have enough resources.

Publisher

IGI Global

Subject

Artificial Intelligence,Management of Technology and Innovation,Information Systems and Management,Organizational Behavior and Human Resource Management,Strategy and Management,Information Systems

Reference68 articles.

1. Adams, O. (2016). Learning a Lexicon and Translation Model from Phoneme Lattices. EMNLP, 2016.

2. Anastasakos, T. A. (1997). Speaker adaptive training: a maximum likelihood approach to speaker normalization. In Acoustics, Speech, and Signal Processing (ICASSP; pp. 1043 – 1046), Munich.

3. Bashir, M. F., Javed, A. R., Arshad, M. U., Gadekallu, T. R., Shahzad, W., & Beg, M. O. (2021). Context aware emotion detection from low resource URDU language using deep neural network. Transactions on Asian and Low-Resource Language Information Processing, 2021.

4. Prosody Dependent Mandarin Speech Recognition.;J. N.Chong;International Joint Conference on Neural Networks,2011

5. Deng, L. (2012). Scalable stacking and learning for building deep architectures. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Constructing an Interpretive Structural Model to Unravel the Interconnected Drivers of Teaching Quality in Higher Education;International Journal of Knowledge and Systems Science;2024-02-26