Cepstral and linear prediction techniques for improving intelligibility and audibility of impaired speech
G. Ravindran, S. Shenbagadevi, V. Salai Selvam
.
DOI: 10.4236/jbise.2010.31013   PDF    HTML     4,638 Downloads   8,869 Views   Citations

Abstract

Human speech becomes impaired i.e., unintelligible due to a variety of reasons that can be either neurological or anatomical. The objective of the research was to improve the intelligibility and audibility of the impaired speech that resulted from a disabled human speech mechanism with impairment in the acoustic system-the supra-laryngeal vocal tract. For this purpose three methods are presented in this paper. Method 1 was to develop an inverse model of the speech degradation using the Cepstral technique. Method 2 was to replace the degraded vocal tract response by a normal vocal tract response using the Cepstral technique. Method 3 was to replace the degraded vocal tract response by a normal vocal tract response using the Linear Prediction technique.

Share and Cite:

Ravindran, G. , Shenbagadevi, S. and Selvam, V. (2010) Cepstral and linear prediction techniques for improving intelligibility and audibility of impaired speech. Journal of Biomedical Science and Engineering, 3, 85-94. doi: 10.4236/jbise.2010.31013.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] (2004) NICHCY disability fact sheet., Speech & Language Impairments. NICHCY. 11.
[2] (2002) Department of Education, Special education programs and services guide, State of Michigan State.
[3] Shuzo, S. and Kazuo, N. (1985) Fundamental of Speech Signal Processing. Academic Press, London.
[4] Rabiner, L.R. and Schafer, R.W. (1978) Digital processing of speech signal, Prentice-Hall, Engliwood Cliffs, NJ.
[5] Rabiner, L.R. and Juang, B.H. (1993) Fundamentals of speech recognition, Prentice-Hall, Engliwood Cliffs, NJ.
[6] Rabiner, L.R. and Bernard, G. (1992) Theory and application of digital signal processing, Prentice-Hall of India, New Delhi, Chapter 12.
[7] Thomas, F.Q. (2004) Discrete-time speech signal processing. Pearson Education, Singapore.
[8] Oppenheim, A.V. and Schafer, R.W. (1992) discrete-time signal processing, Prentice-Hall of India, New Delhi.
[9] Oppenheim, A.V. (1969) Speech analysis-synthesis based on homomorphic filtering, Journal of Acoustic Society of America, 45, 458-465.
[10] Oppenheim, A.V. (1976) Signal analysis by homomorphic prediction. Proc. IEEE, ASSP, 24, 327.
[11] Proakis, J. G. and Manolakis, D. G. (2000) Digital Signal Processing, Prentice-Hall of India, New Delhi.
[12] Tony, R. (1998) Speech Analysis Lent Term.
[13] Nipul, B, Sara, M., Slavinskym J.P. and Aamirm V. (2000) A project on speaker recognition’ rice university.
[14] Makhoul, J. (1975) Linear prediction: a tutorial review, Proc. IEEE, 63, 561-580.
[15] Jean, L. and Mark, D. (1999) New phase-vocoder techniques for pitch-shifting, harmonizing and other exotic effects, Proc. IEEE WASPAA.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.