Observatorio de I+D+i UPM

Memorias de investigación
Communications at congresses:
Towards Glottal Source Controllability in Expressive Speech Synthesis
Year:2012
Research Areas
  • Electronic technology and of the communications,
  • Electric engineers, electronic and automatic (eil)
Information
Abstract
In order to obtain more human like sounding humanmachine interfaces we must first be able to give them expressive capabilities in the way of emotional and stylistic features so as to closely adequate them to the intended task. If we want to replicate those features it is not enough to merely replicate the prosodic information of fundamental frequency and speaking rhythm. The proposed additional layer is the modification of the glottal model, for which we make use of the GlottHMM parameters. This paper analyzes the viability of such an approach by verifying that the expressive nuances are captured by the aforementioned features, obtaining 95% recognition rates on styled speaking and 82% on emotional speech. Then we evaluate the effect of speaker bias and recording environment on the source modeling in order to quantify possible problems when analyzing multi-speaker databases. Finally we propose a speaking styles separation for Spanish based on prosodic features and check its perceptual significance.
International
Si
Congress
InterSpeech 2012, 13th Annual Conference of the International Speech Communication Association
960
Place
Portland, Oregon
Reviewers
Si
ISBN/ISSN
1990-9772
Start Date
09/09/2012
End Date
13/09/2012
From page
1
To page
4
InterSpeech 2012, 13th Annual Conference of the International Speech Communication Association
Participants
  • Autor: Jaime Lorenzo Trueba (UPM)
  • Autor: Roberto Barra Chicote (UPM)
  • Autor: Tuomo Raitio (Department of Signal Processing and Acoustics, Aalto University, Finland)
  • Autor: Nicolas Obin (Sound Analysis and Synthesis, IRCAM, Paris, France)
  • Autor: Paavo Alku (Department of Signal Processing and Acoustics, Aalto University, Finland)
  • Autor: Yunichi Yamagishi (CSTR, University of Edinburgh, United Kingdom)
  • Autor: Juan Manuel Montero Martinez (UPM)
Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica
S2i 2020 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)