База данных для исследования вариативности твердых/мягких согласных перед е в заимствованных словах
The paper presents the initial/preparatory stage of the study of variation of hard/soft consonants before e in loanwords (ka[f]e). The main goal is to compile a database ofrelevant words for use in sociolinguistic research. The database is based on the list of word forms containing relevant contexts in users’ queries to Yandex. All entries in the database are annotated for parameters that may be important in a variational study of the phenomenon. The article describes how the list was compiled and the principles of its annotation. The latter includes the consonant, the position of the consonant re the stressed syllable, the type of syllable where it occurs (open/closed), the year of the first occurrence of the word in Russian National Corpus; the language from which it was borrowed; its frequency. The database may be used to select stimuli for experimental studies of variation in modern speech and of its social correlates (age, gender, education, etc).