Observatorio de I+D+i UPM

Memorias de investigación
Research Publications in journals:
Novel Applications of Neural Networks in Speech Technology Systems: Search Space Reduction and Prosodic Modeling
Year:2009
Research Areas
  • Artificial intelligence,
  • Electronic circuits,
  • Electronic devices
Information
Abstract
Neural networks (NNs) have been extensively used in speech technology systems. In this paper, we present two novel applications of NNs in speech recognition and text-to-speech systems. In very large vocabulary speech recognition systems using the hypothesis-verification paradigm, the verification stage is usually the most time consuming. State of the art systems combine fixed size hypothesized search spaces with advanced pruning techniques. We propose a novel strategy to dynamically calculate the hypothesized search space, using neural networks as the estimation module and designing the input feature set with a careful greedy-based selection approach. The main achievement has been a statistically significant relative decrease in error rate of 33.53%, while getting a relative decrease in average computational demands of up to 19.40%. The prosodic modeling is one of the most important tasks for developing a new text-to-speech synthesizer, especially in a female-voice high-quality restricted-domain system. Our double objective is to get accurate predictors for both the fundamental frequency (F0) curve and phoneme duration by minimizing the model estimation error in a Spanish text-to-speech system, by means of a neural network estimator, which has proved to be an excellent tool for the modeling. The resulting system predicts prosody with very good results (for duration: 15.5 ms in RMS and a correlation factor of 0.8975; for F0: 19.80 Hz in RMS and a relative RMS error of 0.43) that clearly improves our previous rule-based system.
International
Si
JCR
Si
Title
INTELLIGENT AUTOMATION AND SOFT COMPUTING
ISBN
1079-8587
Impact factor JCR
0,224
Impact info
Volume
15
Journal number
4
From page
631
To page
646
Month
ENERO
Ranking
Participants
  • Autor: Javier Macias Guarasa (UPM)
  • Autor: Juan Manuel Montero Martinez (UPM)
  • Autor: Javier Ferreiros Lopez (UPM)
  • Autor: Ricardo de Cordoba Herralde (UPM)
  • Autor: Ruben San Segundo Hernandez (UPM)
  • Autor: Juana Maria Gutierrez Arriola (UPM)
  • Autor: Luis Fernando D'Haro Enriquez (UPM)
  • Autor: Fernando Fernandez Martinez (UPM)
  • Autor: Roberto Barra Chicote (UPM)
  • Autor: Jose Manuel Pardo Muñoz (UPM)
Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería de Circuitos y Sistemas
  • Departamento: Ingeniería Electrónica
S2i 2020 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)