Memorias de investigación
Ponencias en congresos:
Similar Terms Grouping Yields Faster Terminological Saturation
Año:2018

Áreas de investigación
  • Ciencias de la computación y tecnología informática

Datos
Descripción
This paper reports on the refinement of the algorithm for measuring terminological difference between text datasets (THD). This baseline THD algorithm, developed in the OntoElect project, used exact string matches for term comparison. In this work, it has been refined by the use of appropriately selected string similarity measures (SSM) for grouping the terms, which look similar as text strings and presumably have similar meanings. To determine rational term similarity thresholds for several chosen SSMs, the measures have been implemented as software functions and evaluated on the developed test set of term pairs in English. Further, the refined algorithm implementation has been evaluated against the baseline THD algorithm. For this evaluation, the bags of terms have been used that had been extracted from the three different document collections of scientific papers, belonging to different subject domains. The experiment revealed that the use of the refined THD algorithm, compared to the baseline, resulted in quicker terminological saturation on more compact sets of source documents, though at an expense of a noticeably higher computation time.
Internacional
Si
Nombre congreso
ICTERI 2018: Information and Communication Technologies in Education, Research, and Industrial Applications
Tipo de participación
OTHERS
Lugar del congreso
Kiev, Ucrania
Revisores
Si
ISBN o ISSN
978-3-030-13929-2
DOI
10.1007/978-3-030-13929-2_3
Fecha inicio congreso
14/05/2018
Fecha fin congreso
17/05/2018
Desde la página
43
Hasta la página
70
Título de las actas
14th International Conference, ICTERI 2018, Kyiv, Ukraine, May 14-17, 2018, Revised Selected Papers

Esta actividad pertenece a memorias de investigación

Participantes
  • Autor: Victoria Kosa Department of Computer Science, Zaporizhzhia National University
  • Autor: David Chaves Fraga UPM
  • Autor: Nataliya Keberle Department of Computer Science, Zaporizhzhia National University
  • Autor: Aliaksandr Birukou Springer-Verlag

Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Departamento: Inteligencia Artificial