Observatorio de I+D+i UPM

Memorias de investigación
Proyecto de I+D+i:
Áreas de investigación
  • Tecnología electrónica y de las comunicaciones,
  • Ingeniería eléctrica, electrónica y automática
The Simple4All project will create speech synthesis technology that learns from data with little or no expert supervision and continually improves itself, simply by being used. In order to be accepted by users, the voice of a spoken interaction system must be natural and appropriate for the content. Using the same voice for every application is not acceptable to users. But creating a speech synthesiser for a new language or domain is too expensive, because current technology relies on labelled data and human expertise. Systems comprise rules, statistical models, and data, requiring careful tuning by experienced engineers. So, speech synthesis is available from a small number of vendors, offering generic products, not tailored to any application domain. Systems are not portable: creating a bespoke system for a specific application is hard, because it involves substantial effort to re-engineer every component of the system. Take-up by potential end users is limited; the range of feasible applications is narrow. Synthesis is often an off-the-shelf component, providing a highly inappropriate speaking style for applications such as dialogue, speech translation, games, personal assistants, communication aids, SMS-to-speech conversion, e-learning, toys and a multitude of other applications where a specific speaking style is important. We will develop methods that enable the construction of systems from audio and text data. We will enable systems to learn after deployment. General purpose or specialised systems for any domain or language will become feasible. Our objectives are: Adaptability: create highly portable and adaptable speech synthesis technology suitable for any domain or language Learning from data and interaction: provide a complete, consistent framework in which every component of a speech synthesis system can be learned and improved Speaking style: enable the generation of natural, conversational, highly expressive synthetic speech which is appropriate to the wider context Demonstration and evaluation: automatic creation of a new speech synthesiser from scratch, and feedback-driven online learning, with perceptual evaluations.
Tipo de proyecto
Proyectos y convenios en convocatorias públicas competitivas
Entidad financiadora
Comisión Europea
Nacionalidad Entidad
Tamaño de la entidad
Gran Empresa (>250)
Fecha concesión
Esta actividad pertenece a memorias de investigación
  • Director: Juan Manuel Montero Martinez (UPM)
  • Participante: Javier Ferreiros Lopez (UPM)
  • Participante: José Carlos González Cristóbal (UPM- ETSIT)
  • Participante: Jose Manuel Pardo Muñoz (UPM)
  • Participante: Julián David Echeverry Correa (UPM - ETSIT)
  • Participante: Ricardo de Cordoba Herralde (UPM)
  • Participante: Roberto Barra Chicote (UPM)
  • Participante: Ruben San Segundo Hernandez (UPM)
Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Centro o Instituto I+D+i: Centro de I+d+i en Procesado de la Información y Telecomunicaciones
  • Departamento: Ingeniería Electrónica
S2i 2023 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)