Memorias de investigación
Communications at congresses:
Incorporation of Language Discriminative Information into Recurrent Neural Networks Models to LID Tasks
Year:2019

Research Areas
  • Information technology and adata processing

Information
Abstract
Language Identification (LID) is an essential research topic in the Automatic Recognition Speech area. One of the most important characteristics relative to language is context information. In this article, considering a phonotactic approach where the phonetic units called ?phone-grams? are used, in order to introduce such context information, a novel technique is proposed. Language discriminative information has been incorporated in the Recurrent Neural Network Language Models generation (RNNLMs) in the weights initialization stage to improve the Language Identification task. This technique has been evaluated using KALAKA-3 database that contains 108 h of audios of six languages to be recognized. The metric used in this work has been the Average Detection Cost metric Cavg. In relation to the phonetic units called ?phone-grams? used in order to incorporate context information in the features used to train the RNNLM, it has been considered phone-grams of two elements ?2phone-grams? and three elements ?3phone-grams?, obtaining a relative improvement up to 17% and 15,44% respectively compared to the results obtaining using RNNLMs.
International
Si
Congress
International Conference on Smart Technologies, Systems and Applications
960
Place
Cham
Reviewers
Si
ISBN/ISSN
18650929
10.1007/978-3-030-46785-2_14
Start Date
02/12/2019
End Date
03/12/2019
From page
165
To page
175
Actas del International Conference on Smart Technologies, Systems and Applications
Participants

Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica
  • Centro o Instituto I+D+i: Centro de I+d+i en Procesado de la Información y Telecomunicaciones