Memorias de investigación
Ponencias en congresos:
Detecting Acronyms from Capital Letter Sequences in Spanish
Año:2012

Áreas de investigación
  • Tecnología electrónica y de las comunicaciones,
  • Ingeniería eléctrica, electrónica y automática

Datos
Descripción
This paper presents an automatic strategy to decide how to pronounce a Capital Letter Sequence (CLS) in a Text to Speech system (TTS). If CLS is well known by the TTS, it can be expanded in several words. But when the CLS is unknown, the system has two alternatives: spelling it (abbreviation) or pronouncing it as a new word (acronym). In Spanish, there is a high relationship between letters and phonemes. Because of this, when a CLS is similar to other words in Spanish, there is a high tendency to pronounce it as a standard word. This paper proposes an automatic method for detecting acronyms. Additionaly, this paper analyses the discrimination capability of some features, and several strategies for combining them in order to obtain the best classifier. For the best classifier, the classification error is 8.45%. About the feature analysis, the best features have been the Letter Sequence Perplexity and the Average N-gram order.
Internacional
Si
Nombre congreso
InterSpeech 2012, 13th Annual Conference of the International Speech Communication Association
Tipo de participación
960
Lugar del congreso
Portland, Oregon.
Revisores
Si
ISBN o ISSN
1990-9772
DOI
Fecha inicio congreso
09/09/2013
Fecha fin congreso
13/09/2013
Desde la página
1
Hasta la página
4
Título de las actas
InterSpeech 2012, 13th Annual Conference of the International Speech Communication Association

Esta actividad pertenece a memorias de investigación

Participantes

Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica