?
ESC Corpus of Spoken Russian: Everyday Student Conversations Captured through Continuous Speech Recording in Natural Communicative Environments
This article describes the methodology for creating a new resource of everyday Russian speech, based on audio recordings made by student volunteers over the course of an entire day in natural communication settings (at home, in the university, at the café, in the fitness club, etc.). The precursor to this corpus is the well-known ORD corpus, or the “One Day of Speech” corpus, for which recordings were made from 2007 to 2016. Since the ORD recording were made, certain changes have occurred in Russian spoken language, particularly noticeable in the speech of young people at the lexical level. The creation of the new speech resource aims to capture this linguistic snapshot to identify
new colloquial vocabulary, as well as new meanings and connotations of known language units. The new recordings of everyday spoken language will supplement the empirical material of the ORD corpus and provide a foundation for various scientific, theoretical, and practical endeavors. The article details the methodology for creating the Everyday Student Conversations (ESC) corpus, highlights its differences from the ORD corpus, and provides current ESC corpus statistics.