Memorias de investigación
Communications at congresses:
Language Recognition using Phonotactic-based Shifted Delta Coefficients and Multiple Phone Recognizers
Year:2014

Research Areas
  • Electronic technology and of the communications,
  • Electric engineers, electronic and automatic (eil)

Information
Abstract
A new language recognition technique based on the application of the philosophy of the Shifted Delta Coefficients (SDC) to phone log-likelihood ratio features (PLLR) is described. The new methodology allows the incorporation of long-span phonetic information at a frame-by-frame level while dealing with the temporal length of each phone unit. The proposed features are used to train an i-vector based system and tested on the Albayzin LRE 2012 dataset. The results show a relative improvement of 33.3% in Cavg in comparison with different state-of-the-art acoustic i-vector based systems. On the other hand, the integration of parallel phone ASR systems where each one is used to generate multiple PLLR coefficients which are stacked together and then projected into a reduced dimension are also presented. Finally, the paper shows how the incorporation of state information from the phone ASR contributes to provide additional improvements and how the fusion with the other acoustic and phonotactic systems provides an important improvement of 25.8% over the system presented during the competition.
International
Si
Congress
15th Annual Conference of the International Speech Communication Association. Interspeech 2014
960
Place
Singapore
Reviewers
Si
ISBN/ISSN
978-1-63439-435-2
Start Date
14/09/2014
End Date
18/09/2014
From page
3042
To page
3046
Proceedings Interspeech 2014
Participants

Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica