Cepstral and linear prediction techniques for improving intelligibility and audibility of impaired speech

G. Ravindran; S. Shenbagadevi; V. Salai Selvam

doi:10.4236/jbise.2010.31013

Journal of Biomedical Science and Engineering > Vol.3 No.1, January 2010

Cepstral and linear prediction techniques for improving intelligibility and audibility of impaired speech

G. Ravindran, S. Shenbagadevi, V. Salai Selvam
.
DOI: 10.4236/jbise.2010.31013 PDF HTML 4,638 Downloads 8,869 Views Citations

Abstract

Human speech becomes impaired i.e., unintelligible due to a variety of reasons that can be either neurological or anatomical. The objective of the research was to improve the intelligibility and audibility of the impaired speech that resulted from a disabled human speech mechanism with impairment in the acoustic system-the supra-laryngeal vocal tract. For this purpose three methods are presented in this paper. Method 1 was to develop an inverse model of the speech degradation using the Cepstral technique. Method 2 was to replace the degraded vocal tract response by a normal vocal tract response using the Cepstral technique. Method 3 was to replace the degraded vocal tract response by a normal vocal tract response using the Linear Prediction technique.

Keywords

Impaired Speech; Speech Disability; Cepstrum; LPC; Vocal Tract

Share and Cite:

Ravindran, G. , Shenbagadevi, S. and Selvam, V. (2010) Cepstral and linear prediction techniques for improving intelligibility and audibility of impaired speech. Journal of Biomedical Science and Engineering, 3, 85-94. doi: 10.4236/jbise.2010.31013.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1]	(2004) NICHCY disability fact sheet., Speech & Language Impairments. NICHCY. 11.
[2]	(2002) Department of Education, Special education programs and services guide, State of Michigan State.
[3]	Shuzo, S. and Kazuo, N. (1985) Fundamental of Speech Signal Processing. Academic Press, London.
[4]	Rabiner, L.R. and Schafer, R.W. (1978) Digital processing of speech signal, Prentice-Hall, Engliwood Cliffs, NJ.
[5]	Rabiner, L.R. and Juang, B.H. (1993) Fundamentals of speech recognition, Prentice-Hall, Engliwood Cliffs, NJ.
[6]	Rabiner, L.R. and Bernard, G. (1992) Theory and application of digital signal processing, Prentice-Hall of India, New Delhi, Chapter 12.
[7]	Thomas, F.Q. (2004) Discrete-time speech signal processing. Pearson Education, Singapore.
[8]	Oppenheim, A.V. and Schafer, R.W. (1992) discrete-time signal processing, Prentice-Hall of India, New Delhi.
[9]	Oppenheim, A.V. (1969) Speech analysis-synthesis based on homomorphic filtering, Journal of Acoustic Society of America, 45, 458-465.
[10]	Oppenheim, A.V. (1976) Signal analysis by homomorphic prediction. Proc. IEEE, ASSP, 24, 327.
[11]	Proakis, J. G. and Manolakis, D. G. (2000) Digital Signal Processing, Prentice-Hall of India, New Delhi.
[12]	Tony, R. (1998) Speech Analysis Lent Term.
[13]	Nipul, B, Sara, M., Slavinskym J.P. and Aamirm V. (2000) A project on speaker recognition’ rice university.
[14]	Makhoul, J. (1975) Linear prediction: a tutorial review, Proc. IEEE, 63, 561-580.
[15]	Jean, L. and Mark, D. (1999) New phase-vocoder techniques for pitch-shifting, harmonizing and other exotic effects, Proc. IEEE WASPAA.

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies