Memorias de investigación
Artículos en revistas:
Speaker Diarization for Multiple-Distant-Microphone Meetings Using Several Sources of Information
Año:2007

Áreas de investigación
  • Inteligencia artificial,
  • Industria electrónica

Datos
Descripción
Human-machine interaction in meetings requires the localization and identification of the speakers interacting with the system, as well as the recognition of the words spoken. A seminal step toward this goal is the field of rich transcription research, which includes speaker diarization together with the annotation of sentence boundaries and the elimination of speaker disfluencies. The subarea of speaker diarization attempts to identify the number of participants in a meeting and create a list of speech time intervals for each such participant. In this paper, we analyze the correlation between signals coming from multiple microphones and propose an improved method for carrying out speaker diarization for meetings with multiple distant microphones. The proposed algorithm makes use of acoustic information and information from the delays between signals coming from the different sources. Using this procedure, we were able to achieve state-of-the-art performance in the NIST spring 2006 rich transcription evaluation, improving the Diarization Error Rate (DER) by 15 percent to 28 percent relative to previous systems.
Internacional
Si
JCR del ISI
Si
Título de la revista
IEEE T COMPUT
ISSN
0018-9340
Factor de impacto JCR
1,68
Información de impacto
Volumen
56
DOI
Número de revista
9
Desde la página
1212
Hasta la página
1224
Mes
SEPTIEMBRE
Ranking

Esta actividad pertenece a memorias de investigación

Participantes
  • Autor: Xavier Anguera ICSI, Berkeley, USA
  • Autor: Jose Manuel Pardo Muñoz UPM
  • Autor: Charles Wooters ICSI, Berkeley, USA

Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de tecnología del habla
  • Departamento: Ingeniería Electrónica