Observatorio de I+D+i UPM

Memorias de investigación

Ponencias en congresos:

A�o:2014

Descripci�n
One of the biggest challenges in speech synthesis is the production of contextually-appropriate naturally sounding synthetic voices. This means that a Text-To-Speech system must be able to analyze a text beyond the sentence limits in order to select, or even modulate, the speaking style according to a broader context. Our current architecture is based on a two-step approach: text genre identification and speaking style synthesis according to the detected discourse genre. For the final implementation, a set of four genres and their corresponding speaking styles were considered: broadcast news, live sport commentaries, interviews and political speeches. In the final TTS evaluation, the four speaking styles were transplanted to the neutral voices of other speakers not included in the training database. When the transplanted styles were compared to the neutral voices, transplantation was significantly preferred and the similarity to the target speaker was as high as 78%.
Internacional	Si
Nombre congreso	2nd International Workshop on Speech, Language and Audio in Multimedia (SLAM2014)
Tipo de participaci�n	960
Lugar del congreso	Penang, Malaysia
Revisores	Si
ISBN o ISSN	978-967-394-199-5
DOI
Fecha inicio congreso	11/09/2014
Fecha fin congreso	12/09/2014
Desde la p�gina	39
Hasta la p�gina	42
T�tulo de las actas	Proceedings of the 2nd International Workshop on Speech, Language and Audio in Multimedia

Participantes

Autor: Jaime Lorenzo Trueba UPM
Autor: Julian David Echeverry Correa UPM
Autor: Roberto Barra Chicote UPM
Autor: Javier Ferreiros Lopez UPM
Autor: Ascensi�n Gallardo Antol�n Universidad Carlos III de Madrid (UC3M)
Autor: Junichi Yamagishi Centre for Speech Technology Research (CSTR), University of Edinburgh / National Institute of Informatics (NII), Tokio
Autor: Simon King Centre for Speech Technology Research (CSTR), University of Edinburgh
Autor: Juan Manuel Montero Martinez UPM