Memorias de investigación
Communications at congresses:
Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings
Year:2007

Research Areas
  • Artificial intelligence,
  • Electronics engineering

Information
Abstract
In the task of speaker diarization for meetings it has been shown in previous work that it is useful to use the Time Delay of Arrival (TDOA) between the different audio channels in the meeting room as an extra source of information in addition to the acoustic features. When combining feature streams, we use a weight to control the relative contributions of the streams. In the past, this weight was determined using development data and the same weight value was applied to all meetings. In this paper we present a method for automatically determining the weight. A metric derived from the Bayesian Information Criterion (BIC) computed for each feature stream estimates the weight for each meeting on the initial clustering iteration and adapts its value throughout the diarization process. By using this technique we achieve a more robust system and up to 18.2% relative improvement over the method of tuning the weight on development data.
International
Si
Congress
International Conference on Acoustics Speech and Signal Processing, IEEE ICASSP 2007
960
Place
Honolulu, Hawai
Reviewers
Si
ISBN/ISSN
1-4233-0728-1
Start Date
15/04/2007
End Date
20/04/2007
From page
To page
Participants
  • Autor: Xavier Anguera ICSI, Berkeley, CA USA
  • Autor: Jose Manuel Pardo Muñoz UPM
  • Autor: Chuck Wooters ICSI, Berkeley, CA USA

Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Grupo de tecnología del habla
  • Departamento: Ingeniería Electrónica