Observatorio de I+D+i UPM

Memorias de investigación
Communications at congresses:
Continuous Expressive Speaking Styles Synthesis based on CVSM and MR-HMM
Year:2016
Research Areas
  • Electronic technology and of the communications,
  • Electric engineers, electronic and automatic (eil)
Information
Abstract
This paper introduces a continuous system capable of automatically producing the most adequate speaking style to synthesize a desired target text. This is done thanks to a joint modeling of the acoustic and lexical parameters of the speaker models by adapting the CVSM projection of the training texts using MR-HMM techniques. As such, we consider that as long as sufficient variety in the training data is available, we should be able to model a continuous lexical space into a continuous acoustic space. The proposed continuous automatic text to speech system was evaluated by means of a perceptual evaluation in order to compare them with traditional approaches to the task. The system proved to be capable of conveying the correct expressiveness (average adequacy of 3.6) with an expressive strength comparable to oracle traditional expressive speech synthesis (average of 3.6) although with a drop in speech quality mainly due to the semi-continuous nature of the data (average quality of 2.9). This means that the proposed system is capable of improving traditional neutral systems without requiring any additional user interaction.
International
Si
Congress
COLING 2016, The 26th International Conference on Computational Linguistics
960
Place
Osaka, Japan
Reviewers
Si
ISBN/ISSN
9781510833388
Start Date
11/12/2016
End Date
16/12/2016
From page
369
To page
376
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics
Participants
  • Autor: Jaime Lorenzo Trueba (UPM)
  • Autor: Roberto Barra Chicote (UPM)
  • Autor: Ascensión Gallardo Antolín (Universidad Carlos III, Madrid)
  • Autor: Junichi Yamagishi (Associate Professor, Digital Content and Media Sciences Research Division, NII Tokio)
  • Autor: Juan Manuel Montero Martinez (UPM)
Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Centro o Instituto I+D+i: Centro de I+d+i en Procesado de la Información y Telecomunicaciones
  • Departamento: Ingeniería Electrónica
S2i 2020 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)