Observatorio de I+D+i UPM

Memorias de investigación
Artículos en revistas:
Multilingual dictionaries and the Web of Data
Áreas de investigación
  • Ciencias de la computación y tecnología informática
Nowadays, we are witnessing a growing trend in publishing language resources (lexicons, corpora, dictionaries, etc) as Linked Data (LD) on the Web. LD refers to a set of best practices for exposing, sharing and connecting data on the Web (Bizer et al 2009). In short, the LD paradigm requires that (i) resources are represented on the Web via HTTP URIs (Unique Resource Identifiers), (ii) once a resource is accessed via its URI, information about it is obtained, and (iii) such information contains links to other resources. The basic mechanism to support the representation of resources and their related information is the Resource Description Framework (RDF1), which follows the subject-objectpredicate pattern. Resources can be anything, including documents, people, physical objects and abstract concepts. Following LD principles, a ?Web of Data? emerges in which links are at the level of data, as a counterpart to the ?traditional? Web in which links are established at the level of documents (e.g. hyperlinks between webpages). Publishing language resources as LD offers clear advantages to both the data owners and data users, such as higher independence from domain-specific data formats or vendor-specific APIs, as well as easier access and re-use of linguistic data by semantic-aware software agents. Further, we 1 http://w3.org/TR/rdf11-primer/ argue that reaching a critical mass of linguistic data as LD on the Web will set the basis for a new generation of LD-aware Natural Language Processing (NLP) services, with improved scalability and better interoperability level. The latter is, in fact, one of the motivations of LIDER2, a European project that is driving a remarkable community effort in that direction. In this context, the Ontology Engineering Group (OEG3) at Universidad Politécnica de Madrid has started converting a series of bilingual dictionaries and multilingual terminologies and publishing them as LD on the Web. In the following paragraphs we briefly present the RDF conversion process that we have followed, and report on our experience with two of these datasets: Apertium and Terminesp.
Título de la revista
Kernerman Dictionary News
Factor de impacto JCR
Información de impacto
Número de revista
Desde la página
Hasta la página
Esta actividad pertenece a memorias de investigación
  • Autor: Jorge Gracia Del Rio (UPM)
Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Ontology Engineering Group
  • Departamento: Inteligencia Artificial
S2i 2022 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)