Observatorio de I+D+i UPM

Memorias de investigación
Artículos en revistas:
Novel Applications of Neural Networks in Speech Technology Systems: Search Space Reduction and Prosodic Modeling
Año:2009
Áreas de investigación
  • Inteligencia artificial,
  • Circuitos electrónicos,
  • Dispositivos electrónicos
Datos
Descripción
Neural networks (NNs) have been extensively used in speech technology systems. In this paper, we present two novel applications of NNs in speech recognition and text-to-speech systems. In very large vocabulary speech recognition systems using the hypothesis-verification paradigm, the verification stage is usually the most time consuming. State of the art systems combine fixed size hypothesized search spaces with advanced pruning techniques. We propose a novel strategy to dynamically calculate the hypothesized search space, using neural networks as the estimation module and designing the input feature set with a careful greedy-based selection approach. The main achievement has been a statistically significant relative decrease in error rate of 33.53%, while getting a relative decrease in average computational demands of up to 19.40%. The prosodic modeling is one of the most important tasks for developing a new text-to-speech synthesizer, especially in a female-voice high-quality restricted-domain system. Our double objective is to get accurate predictors for both the fundamental frequency (F0) curve and phoneme duration by minimizing the model estimation error in a Spanish text-to-speech system, by means of a neural network estimator, which has proved to be an excellent tool for the modeling. The resulting system predicts prosody with very good results (for duration: 15.5 ms in RMS and a correlation factor of 0.8975; for F0: 19.80 Hz in RMS and a relative RMS error of 0.43) that clearly improves our previous rule-based system.
Internacional
Si
JCR del ISI
Si
Título de la revista
INTELLIGENT AUTOMATION AND SOFT COMPUTING
ISSN
1079-8587
Factor de impacto JCR
0,224
Información de impacto
Volumen
15
DOI
Número de revista
4
Desde la página
631
Hasta la página
646
Mes
ENERO
Ranking
Esta actividad pertenece a memorias de investigación
Participantes
  • Autor: Javier Macias Guarasa (UPM)
  • Autor: Juan Manuel Montero Martinez (UPM)
  • Autor: Javier Ferreiros Lopez (UPM)
  • Autor: Ricardo de Cordoba Herralde (UPM)
  • Autor: Ruben San Segundo Hernandez (UPM)
  • Autor: Juana Maria Gutierrez Arriola (UPM)
  • Autor: Luis Fernando D'Haro Enriquez (UPM)
  • Autor: Fernando Fernandez Martinez (UPM)
  • Autor: Roberto Barra Chicote (UPM)
  • Autor: Jose Manuel Pardo Muñoz (UPM)
Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería de Circuitos y Sistemas
  • Departamento: Ingeniería Electrónica
S2i 2021 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)