Observatorio de I+D+i UPM

Memorias de investigación
Communications at congresses:
HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish
Year:2010
Research Areas
  • Electronic technology and of the communications,
  • Electric engineers, electronic and automatic (eil)
Information
Abstract
In this paper, we describe a new multi-purpose audio-visual database on the context of speech interfaces for controlling household electronic devices. The database comprises speech and video recordings of 19 speakers interacting with a HIFI audio box by means of a spoken dialogue system. Dialogue management is based on Bayesian Networks and the system is provided with contextual information handling strategies. Each speaker was requested to fulfil different sets of specific goals following predefined scenarios, according to both different complexity levels and degrees of freedom or initiative allowed to the user. Due to a careful design and its size, the recorded database allows comprehensive studies on speech recognition, speech understanding, dialogue modeling and management, microphone array based speech processing, and both speech and video-based acoustic source localisation. The database has been labelled for quality and efficiency studies on dialogue performance. The whole database has been validated through both objective and subjective tests.
International
Si
Congress
Seventh conference on International Language Resources and Evaluation (LREC'10), ELRA
960
Place
Valletta, Malta
Reviewers
Si
ISBN/ISSN
2-9517408-6-7
Start Date
19/05/2010
End Date
21/05/2010
From page
2974
To page
2980
Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10), ELRA
Participants
  • Autor: Fernando Fernandez Martinez (UPM)
  • Participante: Javier Macías Guarasa (Universidad Alcalá de Henares, Madrid)
  • Autor: Juan Manuel Lucas Cuesta (UPM)
  • Autor: Roberto Barra Chicote (UPM)
  • Autor: Javier Ferreiros Lopez (UPM)
Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica
S2i 2020 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)