Observatorio de I+D+i UPM

Memorias de investigación
Artículos en revistas:
Automatic categorization for improving Spanish into Spanish Sign Language machine translation
Año:2012
Áreas de investigación
  • Tecnología electrónica y de las comunicaciones,
  • Ingeniería eléctrica, electrónica y automática
Datos
Descripción
This paper describes a preprocessing module for improving the performance of a Spanish into Spanish Sign Language (Lengua de Signos Española: LSE) translation system when dealing with sparse training data. This preprocessing module replaces Spanish words with associated tags. The list with Spanish words (vocabulary) and associated tags used by this module is computed automatically considering those signs that show the highest probability of being the translation of every Spanish word. This automatic tag extraction has been compared to a manual strategy achieving almost the same improvement. In this analysis, several alternatives for dealing with non-relevant words have been studied. Non-relevant words are Spanish words not assigned to any sign. The preprocessing module has been incorporated into two well-known statistical translation architectures: a phrasebased system and a Statistical Finite State Transducer (SFST). This system has been developed for a specific application domain: the renewal of Identity Documents and Driver?s License. In order to evaluate the system a parallel corpus made up of 4,080 Spanish sentences and their LSE translation has been used. The evaluation results revealed a significant performance improvement when including this preprocessing module. In the phrase-based system, the proposed module has given rise to an increase in BLEU (Bilingual Evaluation Understudy) from 73.8% to 81.0% and an increase in the human evaluation score from 0.64 to 0.83. In the case of SFST, BLEU increased from 70.6% to 78.4% and the human evaluation score from 0.65 to 0.82.
Internacional
Si
JCR del ISI
Si
Título de la revista
Computer Speech And Language
ISSN
0885-2308
Factor de impacto JCR
1,319
Información de impacto
Volumen
26
DOI
10.1016/j.csl.2011.09.003
Número de revista
3
Desde la página
149
Hasta la página
167
Mes
SIN MES
Ranking
COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE 50/111
Esta actividad pertenece a memorias de investigación
Participantes
  • Autor: Veronica Lopez Ludeña (UPM)
  • Autor: Ruben San Segundo Hernandez (UPM)
  • Autor: Juan Manuel Montero Martinez (UPM)
  • Autor: Ricardo de Cordoba Herralde (UPM)
  • Autor: Javier Ferreiros Lopez (UPM)
  • Autor: Jose Manuel Pardo Muñoz (UPM)
Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica
S2i 2021 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)