Affiliation:
1. School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, Republic of Korea
Abstract
A multichannel speech enhancement system usually consists of spatial filters such as adaptive beamformers followed by postfilters, which suppress remaining noise. Accurate estimation of the power spectral density (PSD) of the residual noise is crucial for successful noise reduction in the postfilters. In this paper, we propose a postfilter utilizing proposed a posteriori speech presence probability (SPP) and noise PSD estimators, which are based on both the coherence and the statistical models. We model the coherence-based a posteriori SPP as a simple function of the magnitude of coherence between two microphone signals and combine it with a single-channel SPP based on statistical models. The coherence-based estimator for the PSD of the noise remaining in the beamformer output in the presence of speech is derived using the pseudo-coherence considering the effect of the beamformers, which is used to construct the coherence-based noise PSD estimator. Then, the final noise PSD estimator is obtained by combining the coherence-based and statistical model-based noise PSD estimators with the proposed SPP. The spectral gain function is also modified, incorporating the proposed SPP. Experimental results demonstrate that the proposed method led to more accurate noise PSD estimation and perceptual evaluation of speech quality scores in various diffuse noise environments, and did not degrade the speech quality under the presence of directional interference, although the proposed method utilizes the coherence information.
Funder
Institute for Information and Communications Technology Promotion
Reference49 articles.
1. Vary, P., and Martin, R. (2006). Digital Speech Transmission: Enhancement, Coding and Error Concealment, John Wiley & Sons.
2. Benesty, J., Chen, J., and Huang, Y. (2008). Microphone Array Signal Processing, Springer Science & Business Media.
3. Kates, J.M. (2008). Digital Hearing Aids, Plural Publishing.
4. Rabiner, L. (1993). Fundamentals of Speech Recognition, PTR Prentice Hall.
5. Decision-directed speech power spectral density matrix estimation for multichannel speech enhancement;Jin;J. Acoust. Soc. Am.,2017
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献