RuThes Cloud: Towards a Multilevel Linguistic Linked Open Data Resource for Russian

Alexander Kirillovich,Olga Nevzorova,Emil Gimadiev,Natalia Loukachevitch
DOI: https://doi.org/10.1007/978-3-319-69548-8_4
2017-01-01
Abstract:In this paper we present a new multi-level Linguistic Linked Open Data resource for Russian. It covers four linguistic levels: semantic, lexical, morphological and syntactic. The resource has been constructed on base of the well-known RuThes thesaurus and the original hitherto unpublished Extended Zaliznyak grammatical dictionary. The resource is represented in terms of SKOS, Lemon, and LexInfo ontologies and a new custom ontology. Building the resource, we automatically completed the following tasks: merging source resources upon common lexical entries, decomposing complex lexical entries, and publishing constructed resource as LLOD-compatible dataset. We demonstrate the use case in which the developed resource is exploited in IR task. We hope that our work can serve as a crystallization point of the LLOD cloud in Russian.
What problem does this paper attempt to address?