Observatorio de I+D+i UPM

| Otras actividades
HOME

Proyectos Internacionales Art�culos Patentes UPM Software UPM Empresas UPM Otras actividades Memorias de investigaci�n

Memorias de investigación

Cap�tulo de libro:

Text Analysis and Information Extraction from Spanish Written Documents

A�o:2014

�reas de investigaci�n

Interfases mediante lenguaje natural

Datos

Descripci�n
Despite of the spread of Electronic Health Records (EHRs) in Spanish hospitals and Spanish occupying the second place in the ranking of number of speakers, to the best of our knowledge there are no natural language processing tools for medical texts written in Spanish. This paper presents an approach based on OpenNLP to process natural language texts written in Spanish for information extraction. The main goal is to integrate our development with cTAKES. As cTAKES has been specifically trained for the clinical domain, in this paper we will train the main modules from a general purpose annotated Spanish corpus and an in-house corpus developed with medical documents, testing both on a set of medical documents. Best performance of individual components when tested with medical documents: Sentence boundary detector accuracy = 0.872; Part-of-speech tagger accuracy = 0.946; chunker = 0.909.
Internacional	Si
DOI	10.1007/978-3-319-09891-3_18
Edici�n del Libro	1
Editorial del Libro	Springer Link
ISBN	978-3-319-09890-6
Serie	Lecture Notes in Computer Science
T�tulo del Libro	Brain Informatics and Health
Desde p�gina	188
Hasta p�gina	197

Esta actividad pertenece a memorias de investigaci�n

Participantes

Autor: Roberto Costumero Moreno UPM
Autor: Angel Mario Garcia Pedrero UPM
Autor: Consuelo Gonzalo Martin UPM
Autor: Ernestina Menasalvas Ruiz UPM
Autor: Socorro Mill�n

Grupos de investigaci�n, Departamentos, Centros e Institutos de I+D+i relacionados

Creador: Grupo de Investigaci�n: Miner�a de Datos y Simulaci�n (MIDAS)
Departamento: Lenguajes y Sistemas Inform�ticos e Ingenier�a de Software
Departamento: Arquitectura y Tecnolog�a de Sistemas Inform�ticos