Detalhes do Documento

Spectral normalization MFCC derived features for robust speech recognition

Autor(es): Lima, C. S. cv logo 1 ; Tavares, Adriano cv logo 2 ; Silva, Carlos A. cv logo 3 ; Oliveira, Jorge F. cv logo 4

Data: 2004

Identificador Persistente: http://hdl.handle.net/1822/2047

Origem: RepositóriUM - Universidade do Minho

Assunto(s): Robust speech recognition; Features mapping


Descrição
This paper presents a method for extracting MFCC parameters from a normalised power spectrum density. The underlined spectral normalisation method is based on the fact that the speech regions with less energy need more robustness, since in these regions the noise is more dominant, thus the speech is more corrupted. Less energy speech regions contain usually sounds of unvoiced nature where are included nearly half of the consonants, and are by nature the least reliable ones due to the effective noise presence even when the speech is acquired under controlled conditions. This spectral normalisation was tested under additive artificial white noise in an Isolated Speech Recogniser and showed very promising results [1]. It is well known that concerned to speech representation, MFCC parameters appear to be more effective than power spectrum based features. This paper shows how the cepstral speech representation can take advantage of the above-referred spectral normalisation and shows some results in the continuous speech recognition paradigm in clean and artificial noise conditions.
Tipo de Documento Documento de conferência
Idioma Inglês
delicious logo  facebook logo  linkedin logo  twitter logo 
degois logo
mendeley logo

Documentos Relacionados



    Financiadores do RCAAP

Fundação para a Ciência e a Tecnologia Universidade do Minho   Governo Português Ministério da Educação e Ciência Programa Operacional da Sociedade do Conhecimento União Europeia