StimulStat: a lexical database for Russian
In this article, we present StimulStat – a lexical database for the Russian language in the form of a web appli- cation. The database contains more than 52,000 of the most frequent Russian lemmas and more than 1.7 million word forms derived from them. These lemmas and forms are char- acterized according to more than 70 properties that were dem- onstrated to be relevant for psycholinguistic research, includ- ing frequency, length, phonological and grammatical proper- ties, orthographic and phonological neighborhood frequency and size, grammatical ambiguity, homonymy and polysemy. Some properties were retrieved from various dictionaries and are presented collectively in a searchable form for the first time, the others were computed specifically for the database. The database can be accessed freely at http://stimul. cognitivestudies.ru.