Observatorio de I+D+i UPM

Memorias de investigación
Ponencias en congresos:
Extended Phone Log-Likelihood Ratio Features and Acoustic-Based I-Vectors for Language Recognition
Año:2014
Áreas de investigación
  • Tecnología electrónica y de las comunicaciones,
  • Ingeniería eléctrica, electrónica y automática
Datos
Descripción
This paper presents new techniques with relevant improvements added to the primary system presented by our group to the Albayzin 2012 LRE competition, where the use of any additional corpora for training or optimizing the models was forbidden. In this work, we present the incorporation of an additional phonotactic subsystem based on the use of phone log-likelihood ratio features (PLLR) extracted from different phonotactic recognizers that contributes to improve the accuracy of the system in a 21.4% in terms of Cavg (we also present results for the official metric during the evaluation, Fact). We will present how using these features at the phone state level provides significant improvements, when used together with dimensionality reduction techniques, especially PCA. We have also experimented with applying alternative SDC-like configurations on these PLLR features with additional improvements. Also, we will describe some modifications to the MFCC-based acoustic i-vector system which have also contributed to additional improvements. The final fused system outperformed the baseline in 27.4% in Cavg.
Internacional
Si
Nombre congreso
IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP 2014
Tipo de participación
960
Lugar del congreso
Florencia, Italia
Revisores
Si
ISBN o ISSN
9781479928941
DOI
Fecha inicio congreso
04/05/2014
Fecha fin congreso
09/05/2014
Desde la página
5342
Hasta la página
5346
Título de las actas
Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP 2014
Esta actividad pertenece a memorias de investigación
Participantes
  • Autor: Luis Fernando D'Haro Enriquez (UPM)
  • Autor: Ricardo de Cordoba Herralde (UPM)
  • Autor: Christian Raúl Salamea Palacios (UPM)
  • Autor: Julian David Echeverry Correa (UPM)
Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica
S2i 2021 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)