Memorias de investigación
Research Publications in journals:
Speaker Diarization Features: The UPM Contribution to the RT09 Evaluation
Year:2011

Research Areas
  • Electronic technology and of the communications,
  • Electric engineers, electronic and automatic (eil)

Information
Abstract
Two new features have been proposed and used in the Rich Transcription Evaluation 2009 by the Universidad Politécnica de Madrid, which outperform the results of the baseline system. One of the features is the intensity channel contribution, a feature related to the location of the speaker. The second feature is the logarithm of the interpolated fundamental frequency. It is the first time that both features are applied to the clustering stage of multiple distant microphone meetings diarization. It is shown that the inclusion of both features improves the baseline results by 15.36% and 16.71% relative to the development set and the RT 09 set, respectively. If we consider speaker errors only, the relative improvement is 23% and 32.83% on the development set and the RT09 set, respectively.
International
Si
JCR
Si
Title
Ieee Transactions on Audio, Speech, And Language Processing
ISBN
1558-7916
Impact factor JCR
1,668
Impact info
Volume
20
10.1109/TASL.2011.2159971
Journal number
2
From page
426
To page
435
Month
SIN MES
Ranking
Q1
Participants

Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica