A Multi-Band Speech Enhancement Algorithm Exploiting Iterative Processing for Enhancement of Single Channel Speech

DOI: 10.4236/jsip.2013.42027   PDF   HTML     21,304 Downloads   59,313 Views   Citations


This paper proposes a multi-band speech enhancement algorithm exploiting iterative processing for enhancement of single channel speech. In the proposed algorithm, the output of the multi-band spectral subtraction (MBSS) algorithm is used as the input signal again for next iteration process. As after the first MBSS processing step, the additive noise transforms to the remnant noise, the remnant noise needs to be further re-estimated. The proposed algorithm reduces the remnant musical noise further by iterating the enhanced output signal to the input again and performing the operation repeatedly. The newly estimated remnant noise is further used to process the next MBSS step. This procedure is iterated a small number of times. The proposed algorithm estimates noise in each iteration and spectral over-subtraction is executed independently in each band. The experiments are conducted for various types of noises. The performance of the proposed enhancement algorithm is evaluated for various types of noises at different level of SNRs using, 1) objective quality measures: signal-to-noise ratio (SNR), segmental SNR, perceptual evaluation of speech quality (PESQ); and 2) subjective quality measure: mean opinion score (MOS). The results of proposed enhancement algorithm are compared with the popular MBSS algorithm. Experimental results as well as the objective and subjective quality measurement test results confirm that the enhanced speech obtained from the proposed algorithm is more pleasant to listeners than speech enhanced by classical MBSS algorithm.

Share and Cite:

N. Upadhyay and A. Karmakar, "A Multi-Band Speech Enhancement Algorithm Exploiting Iterative Processing for Enhancement of Single Channel Speech," Journal of Signal and Information Processing, Vol. 4 No. 2, 2013, pp. 197-211. doi: 10.4236/jsip.2013.42027.

Conflicts of Interest

The authors declare no conflicts of interest.


[1] D. O’Shaughnessy, “Speech Communications: Human and Machine,” 2nd Edition, University Press (India) Pvt. Ltd., Hyderabad, 2007.
[2] Y. Ephraim, “Statistical-Model-Based Speech Enhancement Systems,” Proceedings IEEE, Vol. 80, No. 10, 1992, pp. 1526-1555. doi:10.1109/5.168664
[3] P. C. Loizou, “Speech Enhancement: Theory and Practice,” Taylor and Francis, London, 2007.
[4] Y. Ephraim, H. L. Ari and W. Roberts, “A Brief Survey of Speech Enhancement,” 3rd Edition, Electrical Engineering Handbook, CRC, Boca Raton, 2006.
[5] Y. Ephraim and I. Cohen, “Recent Advancements in Speech Enhancement,” The Electrical Engineering Handbook, CRC Press, Boca Raton, 2006, pp. 12-26.
[6] J. S. Lim and A. V. Oppenheim, “Enhancement and Bandwidth Compression of Noisy Speech,” Proceedings IEEE, Vol. 67, No. 12, 1979, pp. 1586-1604. doi:10.1109/PROC.1979.11540
[7] S. F. Boll, “Suppression of Acoustic Noise in Speech Using Spectral Subtraction,” IEEE Transaction Acoustic, Speech, Signal Processing, Vol. 27, No. 2, 1979, pp. 113-120. doi:10.1109/TA SSP.1979.1163209
[8] M. Berouti, R. Schwartz and J. Makhoul, “Enhancement of Speech Corrupted by Acoustic Noise,” Proceedings of International Conference on Acoustics, Speech, and Signal Processing, Washington DC, April 1979, pp. 208-211.
[9] S. Kamath and P. Loizou, “A Multi-Band Spectral SubTraction Method for Enhancing Speech Corrupted by Colored Noise,” Proceedings of International Conference on Acoustics, Speech, and Signal Processing, Orlando, 13-17 May 2002, pp. IV-4164. doi:10.1109/ICASSP.2002.5745591
[10] N. Upadhyay and A. Karmakar, “The Spectral Subtractive-Type Algorithms for Enhancement of Noisy Speech: A Review,” International Journal of Research and Reviews in Signal Acquisition and Processing, Vol. 1, No. 3, 2011, pp. 43-49.
[11] N. Upadhyay and A. Karmakar, “Single Channel Speech Enhancement Utilizing Iterative Processing of MultiBand Spectral Subtraction Algorithm,” Proceedings of International Conference on Power, Control and Embedded Systems, Allahabad, 17-19 December 2012, pp. 196-201.
[12] S. Ogata and T. Shimamura, “Reinforced Spectral Subtraction Method to Enhance Speech Signal,” Proceedings of International Conference on Electrical and Electronic Technology, Vol. 1, 2011, pp. 242-245.
[13] S. Li, J.-Q. Wang, M. Niu, X.-J. Jing and T. Liu, “Iterative Spectral Subtraction Method for Mil-Limeter Wave Conducted Speech Enhancement,” Journal Biomedical Science and Engineering, Vol. 3, No. 2, 2010, pp. 187-192. doi:10.4236/jbise.2010.32024
[14] “A Noisy Speech Corpus for Assessment of Speech Enhancement Algorithms,” http://www.utdallas. edu/~loizou/speech/noizeus
[15] “Perceptual Evaluation of Speech Quality (PESQ), and Objective Method for End-to-End Speech Quality Assessment of Narrowband Telephone Networks and Speech Codecs,” ITU, 2000, p. 862.

comments powered by Disqus

Copyright © 2020 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.