Validation of Optimum Algorithm Parameters Required to Estimate Vocal Tract Shape for Children Using LPC Analysis

Abstract

Severe or profound deafness in hearing impaired children, can curb their ability to speak due to the lack of auditory feedback. There has been a considerable attempt in developing commercial speech training aids for such children which give feedback of acoustic and articulatory parameters. Speech training aids based on visual feedback of vocal tract shape (VTS) are reported to be useful for the improvement in speech production. Since realistic VTS estimation for adult speakers and their validation has already been done successfully, VTS estimation is now necessarily required in case of children too, so that they get trained in speech at an early age. The investigation on vocal tract shape estimation based on LPC analysis of speech by appropriately selecting some of the algorithm parameters such as vocal tract length, LPC order, and speech sampling rate has been done in our previous work. This paper attempts to validate the obtained results for vocal tract shapes corresponding to certain recorded vowels from children belonging to specific age groups. Since MRI images of VTS are unavailable for articulating children, validation of our results is based on the results from researchers who have used other indirect techniques to obtain VTS.

Share and Cite:

Wankhede, N. and Shah, M. (2014) Validation of Optimum Algorithm Parameters Required to Estimate Vocal Tract Shape for Children Using LPC Analysis. Open Access Library Journal, 1, 1-10. doi: 10.4236/oalib.1100690.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] Nickerson, R.S. and Stevens, K.N. (1973) Teaching Speech to the Deaf: Can a Computer Help? IEEE Transactions on Audio and Electroacoustics, 21, 445-455.
http://dx.doi.org/10.1109/TAU.1973.1162508
[2] Bernstein, L., Goldstein, J. and Mahshie, J.J. (1988) Speech Training Aids for Hearing-Impaired Individuals: Overview and Aims. Journal of Rehabilitation Research and Development, 25, 53-62.
[3] Park S.H., Kim, D.J., Lee J.H. and Yoon, T.S. (1994) Integrated Speech Training System for Hearing Impaired. Transactions on Neural Systems Rehabilitation Engineering, 2, 189-196.
[4] Bernstein, L.E., Ferguson, J.B. and Goldstein, M.H. (1986) Speech Training Devices for Profoundly Deaf Children. IEEE International Conference on Acoustics, Speech and Signal Processing, 11, 633-636.
[5] Watson, C.S., Elbert, M. and DeVane, G. (1987) The Indiana Speech Training Aid (ISTRA). The Journal of the Acoustical Society of America, 81, 95.
[6] Boothroyd, A., Hanin, L., Yeung, E. and Chen, Q. (1992) Video-Game for Speech Perception Testing and Training of Young Hearing-Impaired Children. Proceedings of the Johns Hopkins National Search for Computing Applications to Assist Persons with Disabilities, Laurel, 1-5 February 1992, 25-28.
[7] Mahdi, A.E. (2008) Visualization of the Vocal-Tract Shape for a Computer-Based Speech Training System for the Hearing-Impaired. The Open Electrical and Electronic Engineering Journal, 2, 27-32.
http://dx.doi.org/10.2174/1874129000802010027
[8] Shah, M.S. and Pandey, P.C. (2005) Estimation of Vocal Tract Shape for VCV Syllables for a Speech Training Aid. Proceedings of 27th Annual Conference of the IEEE Engineering in Medicine and Biology Society, Shanghai, 2005, 6642-6645.
[9] Pandey, P.C. and Nagesh, N. (2009) Estimation of Lip Opening for Scaling of Vocal Tract Area Function for Speech Training Aids. National Conference on Communications (NCC), Kharagpur, 3-5 February 2012, 3-5.
[10] Denby, B. and Stone, M. (2004) Speech Synthesis by Real-Time Ultrasound Images of the Tongue. Proceedings of IEEE International Conference Acoustics, Speech, Signal Process, I, 685-688.
[11] Westbury, J.R. (2014) X-Ray Microbeam Speech Production Database User’s Handbook. Version 1.0.
[12] Ziad, A., Lorenzo, T., Richard, M.S. and Bhiksha, R. (2009) Deriving Vocal Tract Shapes from Electromagnetic Articulograph Data via Geometric Adaptation and Matching. INTERSPEECH’09, 2051-2054.
[13] Story, B.H., Titze, I.R. and Hoffman, E.A. (1996) Vocal Tract Area Functions from Magnetic Resonance Imaging. The Journal of the Acoustical Society of America, 100, 537-554.
http://dx.doi.org/10.1121/1.415960
[14] Bresch, E., Kim, Y., Nayak, K., Byrd, D. and Narayanan, S. (2008) Seeing Speech: Capturing Vocal Tract Shaping Using Real-Time Magnetic Resonance Imaging. IEEE Signal Processing Magazine, 25, 123-132.
http://dx.doi.org/10.1109/MSP.2008.918034
[15] Schroeter, J. and Sondhi, M. (1994) Techniques for Estimating Vocal-Tract Shapes from the Speech Signal. IEEE Transaction on Speech and Audio Processing, 2, 133-150.
[16] Mermelstein, P. (1967) Determination of the Vocal-Tract Shape from Measured Formant Frequencies. Journal of the Acoustical Society of America, 41, 1283-1294.
http://dx.doi.org/10.1121/1.1910470
[17] Ladefoged, P., Harshman, R., Goldstein, L. and Rice, L. (1978) Generating Vocal Tract Shapes from Formant Frequencies. Journal of the Acoustical Society of America, 64, 1027-1035.
http://dx.doi.org/10.1121/1.382086
[18] Calum, D. (2005) Acoustic Pulse Reflectometry for Measurement of the Vocal Tract. PhD Thesis, University of Edinburgh, Edinburgh.
[19] Wakita, H. (1973) Direct Estimation of the Vocal Tract Shape by Inverse Filtering of Acoustic Speech Waveforms. IEEE Transactions on Audio and Electroacoustics, 21, 417-427.
http://dx.doi.org/10.1109/TAU.1973.1162506
[20] Wakita, H. (1979) Estimation of Vocal Tract Shapes from Acoustical Analysis of the Speech Wave: The State of the Art. IEEE Transactions on Acoustics, Speech and Signal Processing, ASSP, 27, 281-285.
http://dx.doi.org/10.1109/TASSP.1979.1163242
[21] Wankhede, N.S. and Shah, M.S. (2013) Investigation on Optimum Parameters for LPC Based Vocal Tract Shape Estimation. 2013 International Conference on Emerging Trends in Communication, Control, Signal Processing & Computing Applications (C2SPCA), Bangalore, 10-11 October 2013, 1-6.
[22] Fitch, W. and Giedd, J. (1999) Morphology and Development of the Human Vocal Tract: A Study Using Magnetic Resonance Imaging. Journal of the Acoustical Society of America, 106, 1511-1522.
http://dx.doi.org/10.1121/1.427148
[23] Bunton, K., Story, B.H. and Titze, I. (2013) Estimation of Vocal Tract Area Functions in Children Based on Measurement of Lip Termination Area and Inverse Acoustic Mapping. ICA 2013 Montreal, Proceedings of Meetings on Acoustics, 19, Article ID: 060054, 1-8.
[24] Rabiner, L.R. and Schafer, R.W. (1978) Digital Processing of Speech Signals. Prentice Hall, Englewood Cliffs.
[25] O’Shaughnessy, D. (1987) Speech Communication: Human and Machine. Addison-Wesley, Reading.
[26] Vorperian, H.K., Wang, S.B., Chung, M., Schimek, E.M., Durtschi, R.B., Kent, R.D., Ziegert, A.J. and Gentry, L.R. (2009) Anatomic Development of the Oral and Pharyngeal Portions of the Vocal Tract: An Imaging Study. Journal of the Acoustical Society of America, 125, 1666-1678.
http://dx.doi.org/10.1121/1.3075589
[27] Vorperian, H.K., Kent, R., Gentry, L. and Yandell, B. (1999) Magnetic Resonance Imaging Procedures to Study the Concurrent Anatomic Development of Vocal Tract Structures: Preliminary Results. International Journal of Pediatric Otorhinolaryngology, 49, 197-206.
http://dx.doi.org/10.1016/S0165-5876(99)00208-6
[28] Vorperian, H.K., Kent, R., Lindstrom, M.J., Kalina, C.M., Gentry, L. and Yandell, B. (2005) Development of Vocal Tract Length during Early Childhood: A Magnetic Resonance Imaging Study. Journal of the Acoustical Society of America, 117, 338-350.
http://dx.doi.org/10.1121/1.1835958

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.