Observatorio de I+D+i UPM

Memorias de investigación
Ponencias en congresos:
HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish
Año:2010
Áreas de investigación
  • Tecnología electrónica y de las comunicaciones,
  • Ingeniería eléctrica, electrónica y automática
Datos
Descripción
In this paper, we describe a new multi-purpose audio-visual database on the context of speech interfaces for controlling household electronic devices. The database comprises speech and video recordings of 19 speakers interacting with a HIFI audio box by means of a spoken dialogue system. Dialogue management is based on Bayesian Networks and the system is provided with contextual information handling strategies. Each speaker was requested to fulfil different sets of specific goals following predefined scenarios, according to both different complexity levels and degrees of freedom or initiative allowed to the user. Due to a careful design and its size, the recorded database allows comprehensive studies on speech recognition, speech understanding, dialogue modeling and management, microphone array based speech processing, and both speech and video-based acoustic source localisation. The database has been labelled for quality and efficiency studies on dialogue performance. The whole database has been validated through both objective and subjective tests.
Internacional
Si
Nombre congreso
Seventh conference on International Language Resources and Evaluation (LREC'10), ELRA
Tipo de participación
960
Lugar del congreso
Valletta, Malta
Revisores
Si
ISBN o ISSN
2-9517408-6-7
DOI
Fecha inicio congreso
19/05/2010
Fecha fin congreso
21/05/2010
Desde la página
2974
Hasta la página
2980
Título de las actas
Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10), ELRA
Esta actividad pertenece a memorias de investigación
Participantes
  • Autor: Fernando Fernandez Martinez (UPM)
  • Participante: Javier Macías Guarasa (Universidad Alcalá de Henares, Madrid)
  • Autor: Juan Manuel Lucas Cuesta (UPM)
  • Autor: Roberto Barra Chicote (UPM)
  • Autor: Javier Ferreiros Lopez (UPM)
Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica
S2i 2021 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)