Memorias de investigación
Ponencias en congresos:
Tagging Spanish Texts: the Problem of se
Año:2008

Áreas de investigación
  • Inteligencia artificial

Datos
Descripción
Automatic tagging in Spanish has historically faced many problems because of some specific grammatical constructions. One of these traditional pitfalls is the ¿se¿ particle. This particle is a multifunctional and polysemous word used in many different contexts. Many taggers do not distinguish the possible uses of ¿se¿ and thus provide poor results at this point. In tune with the philosophy of free software, we have taken a free annotation tool as a basis, we have improved and enhanced its behaviour by adding new rules at different levels and by modifying certain parts in the code to allow for its possible implementation in other EAGLES-compliant tools. In this paper, we present the analysis carried out with different annotators for selecting the tool, the results obtained in all cases as well as the improvements added and the advantages of the modified tagger.
Internacional
Si
Nombre congreso
Sixth International Conference on Language Resources and Evaluation (LREC 2008)
Tipo de participación
960
Lugar del congreso
Marrakech, Morocco
Revisores
Si
ISBN o ISSN
2-9517408-4-0
DOI
Fecha inicio congreso
29/05/2008
Fecha fin congreso
29/05/2008
Desde la página
11
Hasta la página
11
Título de las actas
LREC 2008 Conference Abstracts

Esta actividad pertenece a memorias de investigación

Participantes

Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Ontology Engineering Group (LIA). Laboratorio Inteligencia Artificial. Grupo de Ingeniería Ontológica
  • Departamento: Inteligencia Artificial
  • Departamento: Lingüistica Aplicada a la ciencia y a la Tecnología