Memorias de investigación
Ponencias en congresos:
LOW-RESOURCE LANGUAGE RECOGNITION USING A FUSION OF PHONEME POSTERIORGRAM COUNTS, ACOUSTIC AND GLOTTAL-BASED I-VECTORS
Año:2013

Áreas de investigación
  • Tecnología electrónica y de las comunicaciones,
  • Ingeniería eléctrica, electrónica y automática

Datos
Descripción
This paper presents a description of our system for the Albayzin 2012 LRE competition. One of the main characteristics of this evaluation was the reduced number of available files for training the system, especially for the empty condition where no training data set was provided but only a development set. In addition, the whole database was created from online videos and around one third of the training data was labeled as noisy files. Our primary system was the fusion of three different i-vector based systems: one acoustic system based on MFCCs, a phonotactic system using trigrams of phone-posteriorgram counts, and another acoustic system based on RPLPs that improved robustness against noise. A contrastive system that included new features based on the glottal source was also presented. Official and postevaluation results for all the conditions using the proposed metrics for the evaluation and the Cavg metric are presented in the paper.
Internacional
Si
Nombre congreso
IEEE (ICASSP) International Conference on acoustics, Speech and Signal Processing 2013
Tipo de participación
960
Lugar del congreso
Vancouver, Canada
Revisores
Si
ISBN o ISSN
978-1-4799-0356-6
DOI
Fecha inicio congreso
26/05/2013
Fecha fin congreso
31/05/2014
Desde la página
6852
Hasta la página
6856
Título de las actas
Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013

Esta actividad pertenece a memorias de investigación

Participantes

Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica