Memorias de investigación
Ponencias en congresos:
Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings
Año:2007

Áreas de investigación
  • Inteligencia artificial,
  • Industria electrónica

Datos
Descripción
In the task of speaker diarization for meetings it has been shown in previous work that it is useful to use the Time Delay of Arrival (TDOA) between the different audio channels in the meeting room as an extra source of information in addition to the acoustic features. When combining feature streams, we use a weight to control the relative contributions of the streams. In the past, this weight was determined using development data and the same weight value was applied to all meetings. In this paper we present a method for automatically determining the weight. A metric derived from the Bayesian Information Criterion (BIC) computed for each feature stream estimates the weight for each meeting on the initial clustering iteration and adapts its value throughout the diarization process. By using this technique we achieve a more robust system and up to 18.2% relative improvement over the method of tuning the weight on development data.
Internacional
Si
Nombre congreso
International Conference on Acoustics Speech and Signal Processing, IEEE ICASSP 2007
Tipo de participación
960
Lugar del congreso
Honolulu, Hawai
Revisores
Si
ISBN o ISSN
1-4233-0728-1
DOI
Fecha inicio congreso
15/04/2007
Fecha fin congreso
20/04/2007
Desde la página
Hasta la página
Título de las actas

Esta actividad pertenece a memorias de investigación

Participantes
  • Autor: Xavier Anguera ICSI, Berkeley, CA USA
  • Autor: Jose Manuel Pardo Muñoz UPM
  • Autor: Chuck Wooters ICSI, Berkeley, CA USA

Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de tecnología del habla
  • Departamento: Ingeniería Electrónica