Këpuska, V.Z. and Klein, T.B. (2009) A novel wake-up-word speech recognition system, wake-up-word recognition task, technology and evaluation. Nonlinear Analysis, Theory, Methods & Applications, 71, e2772-e2789. - References

Journals by Subject

Publish with us

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Article citationsMore>>

Këpuska, V.Z. and Klein, T.B. (2009) A novel wake-up-word speech recognition system, wake-up-word recognition task, technology and evaluation. Nonlinear Analysis, Theory, Methods & Applications, 71, e2772-e2789.

has been cited by the following article:

TITLE: Wake-Up-Word Feature Extraction on FPGA

AUTHORS: Veton Z. Këpuska, Mohamed M. Eljhani, Brian H. Hight

KEYWORDS: Speech Recognition System; Feature Extraction; Mel-Frequency Cepstral Coefficients; Linear Predictive Coding Coefficients; Enhanced Mel-Frequency Cepstral Coefficients; Hidden Markov Models; Field-Programmable Gate Arrays

JOURNAL NAME: World Journal of Engineering and Technology, Vol.2 No.1, January 29, 2014

ABSTRACT: Wake-Up-Word Speech Recognition task (WUW-SR) is a computationally very demand, particularly the stage of feature extraction which is decoded with corresponding Hidden Markov Models (HMMs) in the back-end stage of the WUW-SR. The state of the art WUW-SR system is based on three different sets of features: Mel-Frequency Cepstral Coefficients (MFCC), Linear Predictive Coding Coefficients (LPC), and Enhanced Mel-Frequency Cepstral Coefficients (ENH_MFCC). In (front-end of Wake-Up-Word Speech Recognition System Design on FPGA) [1], we presented an experimental FPGA design and implementation of a novel architecture of a real-time spectrogram extraction processor that generates MFCC, LPC, and ENH_MFCC spectrograms simultaneously. In this paper, the details of converting the three sets of spectrograms 1) Mel-Frequency Cepstral Coefficients (MFCC), 2) Linear Predictive Coding Coefficients (LPC), and 3) Enhanced Mel-Frequency Cepstral Coefficients (ENH_MFCC) to their equivalent features are presented. In the WUW- SR system, the recognizer’s frontend is located at the terminal which is typically connected over a data network to remote back-end recognition (e.g., server). The WUW-SR is shown in Figure 1. The three sets of speech features are extracted at the front-end. These extracted features are then compressed and transmitted to the server via a dedicated channel, where subsequently they are decoded.

Open Access

Articles

Development of Application Specific Continuous Speech Recognition System in Hindi

Gaurav Gaurav, Devanesamoni Shakina Deiv, Gopal Krishna Sharma, Mahua Bhattacharya

Journal of Signal and Information Processing Vol.3 No.3, August 31, 2012

DOI: 10.4236/jsip.2012.33052
Open Access

Articles

Arabic Speech Recognition System Based on MFCC and HMMs

Hussien A. Elharati, Mohamed Alshaari, Veton Z. Këpuska

Journal of Computer and Communications Vol.8 No.3, March 5, 2020

DOI: 10.4236/jcc.2020.83003
Open Access

Articles

A Prototype of a Semantic Platform with a Speech Recognition System for Visual Impaired People

Jimmy Rosales-Huamaní, José Castillo-Sequera, Fabricio Puente-Mansilla, Gustavo Boza-Quispe

Journal of Intelligent Learning Systems and Applications Vol.7 No.4, September 29, 2015

DOI: 10.4236/jilsa.2015.74008
Open Access

Articles

Feature Optimization of Speech Emotion Recognition

Chunxia Yu, Ling Xie, Weiping Hu

Journal of Biomedical Science and Engineering Vol.9 No.10B, September 23, 2016

DOI: 10.4236/jbise.2016.910B005
Open Access

Articles

A Combination of Feature Selection and Co-occurrence Matrix Methods for Leukocyte Recognition System

Li Na, Arlends Chris, Bagus Mulyawan

Journal of Software Engineering and Applications Vol.5 No.12B, January 23, 2013

DOI: 10.4236/jsea.2012.512B020

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals by Subject

Publish with us

Article citationsMore>>

Home

About SCIRP

Service

Policies