Observatorio de I+D+i UPM

Memorias de investigación
Ponencias en congresos:
On the use of Phone-based Embeddings for Language Recognition
Año:2018
Áreas de investigación
  • Tecnología electrónica y de las comunicaciones,
  • Ingeniería eléctrica, electrónica y automática
Datos
Descripción
Language Identification (LID) can be defined as the process for automatically identifying the language of a given spoken utterance. We have focused in a phonotactic approach in which the system input is the phonemes sequence generated by a speech recognizer (ASR), but instead phonemes we have used phonetic units that contain context information ?phone-grams?. In this context, we propose the use of Neural Embeddings (NEs) as features for those phone-grams sequences, which are used as entries in a classical i-Vectors framework to train a multi class logistic classifier. These NEs incorporate information from the neighbouring phone-grams in the sequence and model implicitly longer-context information. The NEs have been trained using both a Skip-Gram and a Glove Model. Experiments have been carried out on the KALAKA-3 database and we have used Cavg as metric to compare the systems. We propose as baseline the Cavg obtained using the NEs as features in the LID task, 24,69%. Our strategy to incorporate information from the neighbouring phone-grams to define the final sequences contributes to obtain up to 24,3% relative improvement over the baseline using Skip-Gram model and up to 32,4% using Glove model. Finally, the fusion of our best system with a MFCC-based acoustic i-Vectors system provides up to 34,1% improvement over the acoustic system alone.
Internacional
Si
Nombre congreso
IberSpeech 2018
Tipo de participación
970
Lugar del congreso
Barcelona - España
Revisores
Si
ISBN o ISSN
DOI
10.21437/IberSPEECH.2018-12
Fecha inicio congreso
21/11/2018
Fecha fin congreso
23/11/2018
Desde la página
55
Hasta la página
59
Título de las actas
Proceedings IberSPEECH 2018
Esta actividad pertenece a memorias de investigación
Participantes
  • Autor: Christian Raúl Salamea Palacios (UPM)
  • Autor: Ricardo de Cordoba Herralde (UPM)
  • Autor: Luis Fernando D'Haro Enriquez (UPM)
  • Autor: Ruben San Segundo Hernandez (UPM)
  • Autor: Javier Ferreiros Lopez (UPM)
Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Centro o Instituto I+D+i: Centro de I+d+i en Procesado de la Información y Telecomunicaciones
  • Departamento: Ingeniería Electrónica
S2i 2021 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)