Memorias de investigación
Conferencias:
SynTagRus- a deeply annotated corpus of Russian
Año:2013

Áreas de investigación
  • Filología,
  • Lingüística computacional

Datos
Descripción
The Russian dependency treebank, SynTagRus, is a subcorpus of the National Corpus of the Russian Language and at the time of writing (Spring 2013) contains over 52,000 sentences (roughly 770,000 words). It is supplied with several types of annotation. First, it contains comprehensive morphological and syntactic annotation. The latter is presented in the form of a full dependency tree that uses about 75 distinct dependency labels. Second, SynTagRus partly contains lexical semantic annotation, which means that, for all cases of word sense ambiguity in the corpus, the concrete lexical meaning should be identified and explicitly marked. So far, the number of SYNTAGRUS sentences fully tagged for word senses is over 10,000, and it is constantly growing. Third, a part of SynTagRus is annotated for collocate Lexical Functions. SynTagRus is freely available for research and educational purposes.
Internacional
Si
ISSN o ISBN
Entidad relacionada
International Conference New Directions in Lexical Semantics and Discourse Organization. Organizada por l'Université Stendhal (Grenoble III); Institute of Romance Studies, Universität zu Köln, Germany; Institute of English and American Studies.
Nacionalidad Entidad
Sin nacionalidad
Lugar del congreso
Osnabrück (Alemania)

Esta actividad pertenece a memorias de investigación

Participantes

Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de Validación y Aplicaciones Industriales
  • Departamento: Inteligencia Artificial