Observatorio de I+D+i UPM

Memorias de investigación
Ponencias en congresos:
Towards Speaking Style Transplantation in Speech Synthesis
Año:2013
Áreas de investigación
  • Tecnología electrónica y de las comunicaciones,
  • Ingeniería eléctrica, electrónica y automática
Datos
Descripción
One of the biggest challenges in speech synthesis is the production of naturally sounding synthetic voices. This means that the resulting voice must be not only of high enough quality but also that it must be able to capture the natural expressiveness imbued in human speech. This paper focus on solving the expressiveness problem by proposing a set of different techniques that could be used for extrapolating the expressiveness of proven high quality speaking style models into neutral speakers in HMM-based synthesis. As an additional advantage, the proposed techniques are based on adaptation approaches, which means that they can be used with little training data (around 15 minutes of training data are used in each style for this paper). For the final implementation, a set of 4 speaking styles were considered: news broadcasts, live sports commentary, interviews and parliamentary speech. Finally, the implementation of the 5 techniques were tested through a perceptual evaluation that proves that the deviations between neutral and speaking style average models can be learned and used to imbue expressiveness into target neutral speakers as intended.
Internacional
Si
Nombre congreso
SSW8 2013 - 8th ISCA Speech Synthesis Workshop
Tipo de participación
960
Lugar del congreso
Barcelona (España)
Revisores
Si
ISBN o ISSN
0000-0000
DOI
Fecha inicio congreso
31/08/2013
Fecha fin congreso
02/09/2013
Desde la página
159
Hasta la página
163
Título de las actas
Proceedings SSW8 2013 - 8th ISCA Speech Synthesis Workshop
Esta actividad pertenece a memorias de investigación
Participantes
  • Autor: Jaime Lorenzo Trueba (UPM)
  • Autor: Roberto Barra Chicote (UPM)
  • Autor: Junichi Yamagishi (CSTR, University of Edinburgh, United Kingdom)
  • Autor: Oliver Watts (CSTR, University of Edinburgh, United Kingdom)
  • Autor: Juan Manuel Montero Martinez (UPM)
Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica
S2i 2021 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)