Memorias de investigación
Ponencias en congresos:
On the use of phone-gram units in recurrent neural networks for language identification
Año:2016

Áreas de investigación
  • Ingeniería eléctrica, electrónica y automática

Datos
Descripción
In this paper we present our results on using RNN-based LM scores trained on different phone-gram orders and using different phonetic ASR recognizers. In order to avoid data sparseness problems and to reduce the vocabulary of all possible n-gram combinations, a K-means clustering procedure was performed using phone-vector embeddings as a pre-processing step. Additional experiments to optimize the amount of classes, batch-size, hidden neurons, state-unfolding, are also presented. We have worked with the KALAKA-3 database for the plenty-closed condition [1]. Thanks to our clustering technique and the combination of high level phonegrams, our phonotactic system performs ~13% better than the unigram-based RNNLM system. Also, the obtained RNNLM scores are calibrated and fused with other scores from an acoustic-based i-vector system and a traditional PPRLM system. This fusion provides additional improvements showing that they provide complementary information to the LID system.
Internacional
Si
Nombre congreso
Odyssey 2016 - The Speaker and Language Recognition Workshop
Tipo de participación
960
Lugar del congreso
Bilbao - España
Revisores
Si
ISBN o ISSN
2312-2846
DOI
Fecha inicio congreso
21/06/2016
Fecha fin congreso
24/06/2017
Desde la página
117
Hasta la página
123
Título de las actas
Odyssey 2016: The Speaker and Language Recognition Workshop

Esta actividad pertenece a memorias de investigación

Participantes

Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Centro o Instituto I+D+i: Centro de I+d+i en Procesado de la Información y Telecomunicaciones
  • Departamento: Ingeniería Electrónica