Descripción
|
|
---|---|
Nowadays, several computational techniques for speech recognition have been proposed. These techniques suppose an important improvement in real time applications where speaker interacts with speech recognition systems. Although researchers proposed many methods, none of them solve the high false alarm problem when far-field speakers interfere in a human-machine conversation. This paper presents a two-class (speech and non-speech classes) decision-tree based approach for combining new speech pulse features in a VAD (Voice Activity Detector) for rejecting far-field speech in speech recognition systems. This decision tree is applied over the speech pulses obtained by a baseline VAD composed of a frame feature extractor, a HMM-based (Hidden Markov Model) segmentation module and a pulse detector. The paper also presents a detailed analysis of a great amount of features for discriminating between close and far-field speech. The detection error obtained with the proposed VAD is the lowest compared to other well-known VADs. | |
Internacional
|
Si |
JCR del ISI
|
Si |
Título de la revista
|
Computers & Electrical Engineering |
ISSN
|
0045-7906 |
Factor de impacto JCR
|
0,484 |
Información de impacto
|
|
Volumen
|
37 |
DOI
|
10.1016/j.compeleceng.2011.04.005 |
Número de revista
|
4 |
Desde la página
|
589 |
Hasta la página
|
600 |
Mes
|
SIN MES |
Ranking
|
Q4 |