Memorias de investigación
Capítulo de libro:
SynTagRus: a deeply annotated corpus of Russian
Año:2014

Áreas de investigación
  • Lingüística computacional,
  • Sintaxis

Datos
Descripción
The Russian dependency treebank, SynTagRus, is a subcorpus of the National Corpus of the Russian Language and at the time of writing contains over 52,000 sentences (roughly 770,000 words). It is supplied with several types of annotation. First, it contains comprehensive morphological and syntactic annotation. The latter is presented in the form of a full dependency tree that uses about 75 distinct dependency labels. Second, SynTagRus partly contains lexical semantic annotation, which means that, for all cases of word sense ambiguity in the corpus, the concrete lexical meaning should be identified and explicitly marked. So far, the number of SYNTAGRUS sentences fully tagged for word senses is over 10,000, and it is constantly growing. Third, a part of SynTagRus is annotated for collocate Lexical Functions. SynTagRus is freely available for research and educational purposes.
Internacional
Si
DOI
10.3726/978-3-653-03879-8
Edición del Libro
Editorial del Libro
Edition Peter Lang
ISBN
978-3-631-64608-3
Serie
Título del Libro
Les émotions dans le discours - Emotions in Discourse
Desde página
367
Hasta página
379

Esta actividad pertenece a memorias de investigación

Participantes

Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de Validación y Aplicaciones Industriales
  • Departamento: Inteligencia Artificial