Identification of Noisy Utterance Speech Signal using GA-Based Optimized 2D-MFCC Method and a Bispectrum Analysis

Benyamin Kusumoputro; Agus Buono; Li Na

doi:10.4236/jsea.2012.512B037

Journal of Software Engineering and Applications > Vol.5 No.12B, December 2012

Identification of Noisy Utterance Speech Signal using GA-Based Optimized 2D-MFCC Method and a Bispectrum Analysis

Benyamin Kusumoputro, Agus Buono, Li Na
Department of Computer Science, Bogor Agricultural University, Bogor, Indonesia.
Department of Computer Science, Tarumanagara University, Jakarta, Indonesia.
Department of Electrical Engineering, Universitas Indonesia, Jakarta, Indonesia.
DOI: 10.4236/jsea.2012.512B037 PDF HTML 4,297 Downloads 6,171 Views

Abstract

One-dimensional Mel-Frequency Cepstrum Coefficients (1D-MFCC) in conjunction with a power spectrum analysis method is usually used as a feature extraction in a speaker identification system. However, as this one dimensional feature extraction subsystem shows low recognition rate for identifying an utterance speech signal under harsh noise conditions, we have developed a speaker identification system based on two-dimensional Bispectrum data that was theoretically more robust to the addition of Gaussian noise. As the processing sequence of ID-MFCC method could not be directly used for processing the two-dimensional Bispectrum data, in this paper we proposed a 2D-MFCC method as an extension of the 1D-MFCC method and the optimization of the 2D filter design using Genetic Algorithms. By using the 2D-MFCC method with the Bispectrum analysis method as the feature extraction technique, we then used Hidden Markov Model as the pattern classifier. In this paper, we have experimentally shows our developed methods for identifying an utterance speech signal buried with various levels of noise. Experimental result shows that the 2D-MFCC method without GA optimization has a comparable high recognition rate with that of 1D-MFCC method for utterance signal without noise addition. However, when the utterance signal is buried with Gaussian noises, the developed 2D-MFCC shows higher recognition capability, especially, when the 2D-MFCC optimized by Genetics Algorithms is utilized.

Keywords

2D Mel-Frequency Cepstrum Coefficients, Bispectrum, Hidden Markov Model, Genetics Algorithms

Share and Cite:

B. Kusumoputro, A. Buono and L. Na, "Identification of Noisy Utterance Speech Signal using GA-Based Optimized 2D-MFCC Method and a Bispectrum Analysis," Journal of Software Engineering and Applications, Vol. 5 No. 12B, 2012, pp. 193-199. doi: 10.4236/jsea.2012.512B037.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1]	Z. Li, J. Sun, J. Han, f. Chu and Y. He, Parametric bispectrum analysis of cracked rotor based on blind identification of time series models, IEEE Proceeding of Intelligent Control and Automation, Vol. 2, 2006, pp.5729-5733.
[2]	I. Jouny, E.D. Garber and R.L. Moses, Radar target identification using the bispec-trum: a comparative study, IEEE Trans. Aerospace and Electronic Systems, Vol. 31, No. 1, 1995, pp. 69-77.
[3]	E.S. Fonseca, R.C. Guido, A.C. Silvestre and J.C. Pereira, Discrete wavelet transform and support vector machine applied to pathological voice signals identification, IEEE Proceeding of Interna-tional Symposium on Multimedia, 2005
[4]	Z. Wang and H. Wang, Voice identification system based on server, IEEE Proceeding of Intern. Conf. on Computer Application and System Modeling, Vol. 9, 2010, pp. 384-387.
[5]	M. Abdollahi, E. Valavi and H.A. Noubari, Voice-based gender identification via multiresolution frame classification of spec-tro-temporal maps, IEEE Proceeding of Intern. Joint Conf. on Neural Networks, 2009, pp. 1-4.
[6]	T.D. Ganchev, Speaker Recognition, PhD Dissertation, Wire Communications Laboratory, Department of Computer and Electrical Engineering, University of Patras Greece, 2005
[7]	B. Kusumoputro, A. Tri-yanto, M.I. Fanany and W. Jatmiko, Speaker identi-fication in noisy environment using bispectrum anal-ysis and probabilistic neural networks, IEEE Pro-ceeding of Intern. Conf. on Computational Intelligence and Multimedia Application, 2001, pp.118-123.
[8]	C.L. Nikeas and A.P. Petropulu, Higher order spectra analysis: A Nonlinear Signal Processing Framework, Prentice-Hall, Inc. New Jersey, 1993.
[9]	T.E. Ozkurt and T. Akgul, Robust text-independent speaker identification using bispectrum slice, IEEE Proceeding of Signal Processing and Communications Applications, 2004, pp. 418-421.
[10]	L. Luo and L.F. Chaparro, Para-metric identification of systems using a frequency slice of the bispectrum, IEEE Proceeding of Intern. Conf. on Acoustic, Speech and Signal Processing, Vol. 3, 1991, pp. 3481-3484
[11]	L. Rabiner. A Tutorial on Hidden Markov Model and Selected Applications in Speech Recognition. Proceeding IEEE, Vol 77 No. 2. February 1989.
[12]	Cornaz, C. dan U. Hunkeler. An Automatic Speaker Recognition System. Mini-Project. http://www.ifp.uiuc.edu/~minhdo/teaching/speaker_recognition, access : August, 15, 2008.
[13]	Zbigniew M. Genetic Algorithms + Data structures = Evolution Programs, 3th Edition, Springer, 1996.

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies