Observatorio de I+D+i UPM

Memorias de investigación
Communications at congresses:
UPM system for WMT 2012
Year:2012
Research Areas
  • Electronic technology and of the communications,
  • Electric engineers, electronic and automatic (eil)
Information
Abstract
This paper describes the UPM system for the Spanish-English translation task at the NAACL 2012 workshop on statistical machine translation. This system is based on Moses. We have used all available free corpora, cleaning and deleting some repetitions. In this paper, we also propose a technique for selecting the sentences for tuning the system. This technique is based on the similarity with the sentences to translate. With our approach, we improve the BLEU score from 28.37% to 28.57%. And as a result of the WMT12 challenge we have obtained a 31.80% BLEU with the 2012 test set. Finally, we explain different experiments that we have carried out after the competition.
International
Si
Congress
7th Workshop on Statistical Machine Translation
960
Place
Montréal, Canada
Reviewers
Si
ISBN/ISSN
978-1-937284-20-6
Start Date
07/06/2012
End Date
08/06/2012
From page
338
To page
344
Proceedings of the 7th Workshop on Statistical Machine Translation
Participants
  • Autor: Veronica Lopez Ludeña (UPM)
  • Autor: Ruben San Segundo Hernandez (UPM)
  • Autor: Juan Manuel Montero Martinez (UPM)
Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica
S2i 2020 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)