Observatorio de I+D+i UPM

Memorias de investigación
Book chapters:
Integrating WordNet and Wiktionary with lemon
Year:2012
Research Areas
  • Information technology and adata processing
Information
Abstract
have been created. These resources are confined however to what Tim Berners- Lee has named ?data silos?, as either they are publicly available, albeit in propri- etary formats, or access to them is restricted. This leads to a situation in which the integration of various linguistic data becomes cumbersome. The Linking Open Data project (Berners-Lee (2009)) has aimed to solve these issues by fostering the publication of data on the Web using the RDF data model and, most importantly, linking data across sites. In this paper, we discuss how the principles of Linked Data can be applied to the publication of linguistic data. We discuss in detail the conversion of WordNet and Wiktionary to Linked Data resources using the lemon model as a use case. While WordNet has been already converted to the RDF data model, there are significant challenges in converting a semi-structured resource such as Wiktionary into the RDF data model. We discuss these challenges and how we addressed them. Our use cases demonstrate that lemon can be used as a uni- form, principled and simple model for the publication of lexical resources as linked data as well as their linking. All resources described in this paper are available at http://monnetproject.deri.ie/lemonsource.
International
Si
10.1007/978-3-642-28249-2_3
Book Edition
Book Publishing
Springer
ISBN
978-3-642-28248-5
Series
Book title
Linked Data in Linguistics
From page
25
To page
34
Participants
  • Autor: Elena Montiel Ponsoda (UPM)
Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Ontology Engineering Group
S2i 2019 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)