Memorias de investigación
Communications at congresses:
On the limitations of voice conversion techniques in emotion identification tasks
Year:2007

Research Areas
  • Artificial intelligence,
  • Electronics engineering

Information
Abstract
The growing interest in emotional speech synthesis urges effective emotion conversion techniques to be explored. This paper estimates the relevance of three speech components (spectral envelope, residual excitation and prosody) for synthesizing identifiable emotional speech, in order to be able to customize voice conversion techniques to the specific characteristics of each emotion. The analysis has been based on a listening test with a set of synthetic mixed-emotion utterances that draw their speech components from emotional and neutral recordings. Results prove the importance of transforming residual excitation for the identification of emotions that are not fully conveyed through prosodic means (such as cold anger or sadness in our Spanish corpus).
International
Si
Congress
8th Annual Conference of the Internacional Speech Communication Association (Interspeech 2007)
960
Place
Antwerp, Belgium
Reviewers
Si
ISBN/ISSN
ISSN 1990-9772
Start Date
27/08/2007
End Date
31/08/2007
From page
To page
Participants

Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Grupo de tecnología del habla
  • Departamento: Ingeniería Electrónica