Observatorio de I+D+i UPM

Memorias de investigación
Artículos en revistas:
An insight to the automatic categorization of speakers according to sex and its application to the detection of voice pathologies: A comparative study
Año:2016
Áreas de investigación
  • Ciencias de la computación y tecnología informática
Datos
Descripción
An automatic categorization of the speakers according to their sex improves the performance of an automatic detector of voice pathologies. This is grounded on findings demonstrating perceptual, acoustical and anatomical differences in males' and females' voices. In particular, this paper follows two objectives: 1) to design a system which automatically discriminates the sex of a speaker when using normophonic and pathological speech, 2) to study the influence that this sex detector has on the accuracy of a further voice pathology detector. The parameterization of the automatic sex detector relies on MFCC applied to speech; and MFCC applied to glottal waveforms plus parameters modeling the vocal tract. The glottal waveforms are extracted from speech via iterative lattice inverse filters. Regarding the pathology detector, a MFCC parameterization is applied to speech signals. Classification, in both sex and pathology detectors, is carried out using state of the art techniques based on universal background models. Experiments are performed in the Saarbrücken database, employing the sustained phonation of vowel /a/. Results indicate that the sex of the speaker may be discriminated automatically using normophonic and pathological speech, obtaining accuracy up to 95%. Moreover, including the a-priori information about the sex of the speaker produces an absolute performance improvement in EER of about 2% on pathology detection tasks.
Internacional
No
JCR del ISI
No
Título de la revista
Revista de la Facultad de Ingeniería
ISSN
0798-4065
Factor de impacto JCR
Información de impacto
Volumen
DOI
10.17533/udea.redin.n79a06
Número de revista
79
Desde la página
50
Hasta la página
62
Mes
SIN MES
Ranking
Esta actividad pertenece a memorias de investigación
Participantes
  • Autor: Jorge Andres Gomez Garcia (UPM)
  • Autor: Laureano Moro Velazquez (UPM)
  • Autor: Juan Ignacio Godino Llorente (UPM)
  • Autor: César-Germán Castellanos-Domínguez (Universidad Nacional de Colombia)
Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Informática Aplicada al Procesado de Señal e Imagen
  • Departamento: Teoría de la Señal y Comunicaciones (Provisional)
S2i 2021 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)