Sudden Noise Reduction Based on GMM with Noise Power Estimation

Nobuyuki Miyake; Tetsuya Takiguchi; Yasuo Ariki

doi:10.4236/jsea.2010.34039

Journal of Software Engineering and Applications > Vol.3 No.4, April 2010

Sudden Noise Reduction Based on GMM with Noise Power Estimation

Nobuyuki Miyake, Tetsuya Takiguchi, Yasuo Ariki
Graduate School of Engineering, Kobe University, Kobe, Japan..
DOI: 10.4236/jsea.2010.34039 PDF HTML 5,897 Downloads 10,210 Views Citations

Abstract

This paper describes a method for reducing sudden noise using noise detection and classification methods, and noise power estimation. Sudden noise detection and classification have been dealt with in our previous study. In this paper, GMM-based noise reduction is performed using the detection and classification results. As a result of classification, we can determine the kind of noise we are dealing with, but the power is unknown. In this paper, this problem is solved by combining an estimation of noise power with the noise reduction method. In our experiments, the proposed method achieved good performance for recognition of utterances overlapped by sudden noises.

Keywords

Sudden Noise, Model-Based Noise Reduction, Speech Recognition

Share and Cite:

Miyake, N. , Takiguchi, T. and Ariki, Y. (2010) Sudden Noise Reduction Based on GMM with Noise Power Estimation. Journal of Software Engineering and Applications, 3, 341-346. doi: 10.4236/jsea.2010.34039.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1]	M. Fujimoto, et al., “Particle Filter Based Non-Stationary Noise Tracking for Robust Speech Recognition,” Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2005, pp. 257-260.
[2]	M. Kotta, et al., “Speech Enhancement in Non-Stationary Noise Environments Using Noise Properties,” Speech Communication, Vol. 48, No. 11, 2006, pp. 96-109.
[3]	T. Jitsuhiro, et al., “Robust Speech Recognition Using Noise Suppression Based on Multiple Composite Models and Multi-Pass Search,” Proceedings of Automatic Speech Recognition and Understanding (ASRU), 2007, pp. 53-58.
[4]	T. Hirai, S. Kuroiwa, S. Tsuge, F. Ren, M. A. Fattah, “A Speech Emphasis Method for Noise-Robust Speech Recognition by Using Repetitive Phrase,” Proceedings of International Conference on Chemical Thermodynamics (ICCT), 2006, pp. 1-4.
[5]	P. J. Moreno, B. Raj and R. M. Stern, “A Vector Taylor Series Approach for Environment Independent Speech Recognition,” Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1996, pp. 733-736.
[6]	J. C. Segura, et al., “Model-Based Compensation of the Additive Noise for Continuous Speech Recognition. Experiments Using the AURORA II Database and Tasks,” Proceedings of Eurospeech, 2001, pp. 221-224.
[7]	L. Deng, et al., “Enhancement of Log Mel Power Spectra of Speech Using a Phase-Sensitive Model of the Acoustic Environment and Sequential Estimation of the Corrupting Noise,” IEEE Transactions on Speech and Audio Pro- cessing, Vol. 12, 2004, pp. 133-143.
[8]	N. Miyake, T. Takiguchi and Y. Ariki, “Noise Detection and Classification in Speech Signals with Boosting,” IEEE Workshop on Statistical Signal Processing (SSP), 2007, pp. 778-782.
[9]	Y. Freund, et al., “A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting,” Journal of Computer and System Sciences, Vol. 55, 1997, pp. 119-139.
[10]	S. Nakamura, et al., “Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition,” Proceedings of 2nd ICLRE, 2000, pp. 965-968.

Journals Menu

Follow SCIRP

	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies