Observatorio de I+D+i UPM

Memorias de investigación
Ponencias en congresos:
Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries
Año:2016
Áreas de investigación
  • Ciencias de la computación y tecnología informática
Datos
Descripción
The experiments presented here exploit the properties of the Apertium RDF Graph, principally cycle density and nodes' degree, to automatically generate new translation relations between words, and therefore to enrich existing bilingual dictionaries with new entries. Currently, the Apertium RDF Graph includes data from 22 Apertium bilingual dictionaries and constitutes a large unified array of linked lexical entries and translations that are available and accessible on the Web (http://linguistic.linkeddata.es/apertium/). In particular, its graph structure allows for interesting exploitation opportunities, some of which are addressed in this paper. Two experiments are reported: in the first one, the original EN-ES translation set was removed from the Apertium RDF Graph and a new EN-ES version was generated. The results were compared against the previously removed EN-ES data and against the Concise Oxford Spanish Dictionary. In the second experiment, a new non-existent EN-FR translation set was generated. In this case the results were compared against a converted wiktionary English-French file. The results we got are really good and perform well for the extreme case of correlated polysemy. This led us to address the possibility to use cycles and nodes degree to identify potential oddities in the source data. If cycle density proves efficient when considering potential targets, we can assume that in dense graphs nodes with low degree may indicate potential errors.
Internacional
Si
Nombre congreso
10th Language Resources and Evaluation Conference (LREC'16)
Tipo de participación
960
Lugar del congreso
Portoro? (Slovenia)
Revisores
Si
ISBN o ISSN
978-2-9517408-9-1
DOI
Fecha inicio congreso
23/05/2016
Fecha fin congreso
28/05/2016
Desde la página
868
Hasta la página
876
Título de las actas
Proc. of 10th Language Resources and Evaluation Conference (LREC'16)
Esta actividad pertenece a memorias de investigación
Participantes
  • Autor: Marta Villegas (Universitat Pompeu Fabra)
  • Autor: Maite Melero (Universitat Pompeu Fabra)
  • Autor: Jorge Gracia Del Rio (UPM)
  • Autor: Nuria Bel (Universitat Pompeu Fabra)
Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Ontology Engineering Group
  • Departamento: Inteligencia Artificial
S2i 2022 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)