Observatorio de I+D+i UPM

Memorias de investigación
Ponencias en congresos:
Patrol Team Language Identification System for DARPA RATS P1 Evaluation
Año:2012
Áreas de investigación
  • Tecnología electrónica y de las comunicaciones,
  • Ingeniería eléctrica, electrónica y automática
Datos
Descripción
This paper describes the language identification (LID) system developed by the Patrol team for the first phase of the DARPA RATS (Robust Automatic Transcription of Speech) program, which seeks to advance state of the art detection capabilities on audio from highly degraded communication channels. We show that techniques originally developed for LID on telephone speech (e.g., for the NIST language recognition evaluations) remain effective on the noisy RATS data, provided that careful consideration is applied when designing the training and development sets. In addition, we show significant improvements from the use of Wiener filtering, neural network based and language dependent i-vector modeling, and fusion.
Internacional
Si
Nombre congreso
InterSpeech 2012, 13th Annual Conference of the International Speech Communication Association
Tipo de participación
960
Lugar del congreso
Portland, Oregon
Revisores
Si
ISBN o ISSN
1990-9772
DOI
Fecha inicio congreso
09/09/2012
Fecha fin congreso
13/09/2012
Desde la página
1
Hasta la página
4
Título de las actas
InterSpeech 2012, 13th Annual Conference of the International Speech Communication Association
Esta actividad pertenece a memorias de investigación
Participantes
  • Autor: Pavel Mat¿ejka (Brno University of Technology, Speech@FIT and IT4I Center of Excellence, Czech Republic)
  • Autor: Old¿rich Plchot (Brno University of Technology, Speech@FIT and IT4I Center of Excellence, Czech Republic)
  • Autor: Mehdi Soufifar (Brno University of Technology, Speech@FIT and IT4I Center of Excellence, Czech Republic)
  • Autor: Ond¿rej Glembek (Brno University of Technology, Speech@FIT and IT4I Center of Excellence, Czech Republic)
  • Autor: Luis Fernando D'Haro Enriquez (UPM)
  • Autor: Karel Veselý (Brno University of Technology, Speech@FIT and IT4I Center of Excellence, Czech Republic)
  • Autor: Franti¿sek Grézl (Brno University of Technology, Speech@FIT and IT4I Center of Excellence, Czech Republic)
  • Autor: Jeff Ma (Raytheon BBN Technologies, Cambridge, MA, USA)
  • Autor: Spyros Matsoukas (Raytheon BBN Technologies, Cambridge, MA, USA)
  • Autor: Najim Dehak (MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA, USA)
Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica
S2i 2022 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)