Observatorio de I+D+i UPM

Memorias de investigación
Ponencias en congresos:
Challenges of Terminology Extraction from Legal Spanish Corpora
Año:2018
Áreas de investigación
  • Procesamiento del lenguaje,
  • Acción legal
Datos
Descripción
Untangling the complexities of legal documentation is an imperative need for non practitioners of the legal profession. The terminology used in the domain is complex and it usually requires expert knowledge to be fully understood, since the legal framework is constantly being updated and the meaning of terms vary accordingly. Non-proprietary Automatic Terminology Extraction (ATE) tools are required in this particular domain in which documents contain private and sensitive data. This paper describes methods for obtaining accurate legal terms from labour law corpora, overcoming the difficulties present in the area, and also analyses the peculiarities of the legal jargon, specifically, in Spanish language. The performed experiments, executed with JATE, a wellknown open source library in the ATE literature, are still preliminary, but promising.
Internacional
Si
Nombre congreso
JURIX
Tipo de participación
960
Lugar del congreso
Revisores
Si
ISBN o ISSN
1613-0073
DOI
Fecha inicio congreso
12/12/2018
Fecha fin congreso
14/12/2018
Desde la página
73
Hasta la página
83
Título de las actas
In Proceedings of the 2nd Workshop on Technologies for Regulatory Compliance. CEUR
Esta actividad pertenece a memorias de investigación
Participantes
  • Autor: Pablo Calleja Ibañez (UPM)
Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Departamento: Inteligencia Artificial
S2i 2021 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)