?
Automatic Morphemic Analysis of Russian Words
The paper considers the task of the morphemic analysis of Russian words and compares the efficiency of several proposed models. These models can be divided into three groups: derivational and inflectional rule-based, proba- bilistic, and hybrid models. The latter achieved state-of-the-art results of 0.848 F-score on a test set of 500 Russian words. The models use dictionaries of morphs and words and information about the part of speech and other morphological fea- tures of the word. Importantly, our solution takes into account synchronic word- formative relations between words. This allows for analyzing words in any gram- matical form, as well as previously unseen words. Our system, which we make freely available to the community, also features morphemic annotation of entire texts and search for specified morphs.