Memorias de investigación
Ponencias en congresos:
Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries
Año:2016

Áreas de investigación
  • Ciencias de la computación y tecnología informática

Datos
Descripción
The experiments presented here exploit the properties of the Apertium RDF Graph, principally cycle density and nodes' degree, to automatically generate new translation relations between words, and therefore to enrich existing bilingual dictionaries with new entries. Currently, the Apertium RDF Graph includes data from 22 Apertium bilingual dictionaries and constitutes a large unified array of linked lexical entries and translations that are available and accessible on the Web (http://linguistic.linkeddata.es/apertium/). In particular, its graph structure allows for interesting exploitation opportunities, some of which are addressed in this paper. Two experiments are reported: in the first one, the original EN-ES translation set was removed from the Apertium RDF Graph and a new EN-ES version was generated. The results were compared against the previously removed EN-ES data and against the Concise Oxford Spanish Dictionary. In the second experiment, a new non-existent EN-FR translation set was generated. In this case the results were compared against a converted wiktionary English-French file. The results we got are really good and perform well for the extreme case of correlated polysemy. This led us to address the possibility to use cycles and nodes degree to identify potential oddities in the source data. If cycle density proves efficient when considering potential targets, we can assume that in dense graphs nodes with low degree may indicate potential errors.
Internacional
Si
Nombre congreso
10th Language Resources and Evaluation Conference (LREC'16)
Tipo de participación
960
Lugar del congreso
Portoro? (Slovenia)
Revisores
Si
ISBN o ISSN
978-2-9517408-9-1
DOI
Fecha inicio congreso
23/05/2016
Fecha fin congreso
28/05/2016
Desde la página
868
Hasta la página
876
Título de las actas
Proc. of 10th Language Resources and Evaluation Conference (LREC'16)

Esta actividad pertenece a memorias de investigación

Participantes
  • Autor: Marta Villegas Universitat Pompeu Fabra
  • Autor: Maite Melero Universitat Pompeu Fabra
  • Autor: Jorge Gracia Del Rio UPM
  • Autor: Nuria Bel Universitat Pompeu Fabra

Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Ontology Engineering Group
  • Departamento: Inteligencia Artificial