SciELO - Scientific Electronic Library Online

 
vol.40 issue3Iris Recognition in the Visible Spectrum Based on Eye Image Quality EvaluationProfile Face Image Frontalization based on landmark points and 3D Generic Elastic Model author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

Article

Indicators

  • Have no cited articlesCited by SciELO

Related links

  • Have no similar articlesSimilars in SciELO

Share


Ingeniería Electrónica, Automática y Comunicaciones

On-line version ISSN 1815-5928

Abstract

RAMIREZ SANCHEZ, José Manuel; MONTALVO BEREAU, Ana Rosa  and  CALVO DE LARA, José Ramón. Evaluation of Acoustic Features for the Automatic Speech Recognition in Noise Scenarios using Kaldi. EAC [online]. 2019, vol.40, n.3, pp. 51-71.  Epub Sep 08, 2019. ISSN 1815-5928.

The present investigation will evaluate the impact of Mel Frequency Cepstral Coefficients (MFCC) and the Perceptual Linear Predictors (PLP) coefficients, in the word error rate (WER) of systems dedicated to Automatic Speech Recognition (ASR). The experimentation will be done with voice signals in Spanish language, in scenarios with unknown noise levels and using the Kaldi state of the art tool. The article concludes by providing evidence in favor of the MFCC as acoustic feature more robust in the task of ASR in noisy scenarios with respect to the PLP; also both features behave similarly in low noise scenarios and the impact of PLP in reducing the time spent by systems dedicated to ASR.

Keywords : Automatic Speech Recognition; Acoustic Features; Kaldi.

        · abstract in Spanish     · text in Spanish     · Spanish ( pdf )