Spoken Corpora of Slavic Languages

?

Spoken Corpora of Slavic Languages

Russian linguistics. 2022.

Dobrushina N., Sokur E.

Spoken corpora are collections of transcribed and annotated audio and /or video recordings of languages or language varieties. The aim of this paper is to present an overview of 51 spoken corpora currently available for Slavic languages and dialects, in particular Belarusian, Bulgarian, Croatian, Czech, Polish, Russian, Slovak, Slovenian, Trasianka, Ukrainian/Rusyn. We identify three groups of corpora according to the type of lect: corpora of standard languages (spoken mainly in an urban environment and existing in both written and oral form), dialects (spoken mainly in a rural environment and unwritten), and bilingual varieties (we call bilingual varieties spoken as L2 by people with different L1 languages, as well as all varieties that evolved in a multilingual environment). We survey the corpora in terms of text registers, transcription, and principles of linguistic and extralinguistic annotation. In conclusion, we suggest a list of features that linguists should take into consideration when developing a spoken corpus. Many spoken corpora are currently being created for various Slavic lects, and their developers may use this overview as a source of information on different designs and solutions.

Research target: Philology and Linguistics

Publication based on the results of:

Linguistic convergence mechanisms: factors of different order in interaction (2022)

ХЛЕБНИКОВСКИЕ «ПУШКИНОТЫ МЛЕЮЩЕГО ПОЛДНЯ» И СТИХОТВОРЕНИЕ О. Э. МАНДЕЛЬШТАМА «РАВНОДЕНСТВИЕ»

Abelyuk E., Русская литература 2026 № 4 Статья DOI: 10.31860/0131-6095-2026-4-

The article demonstrates that Mandelstam's poem, "There are orioles in the woods, and the length of vowels..." reveals an internal connection with Khlebnikov's poem “О, dostoevskiymo running cloud!": one of the latter's key images "shines through" the text of the former. Drawing on Mandelstam's definitions that describe the reading process, the author reveals how this connection might have arisen, whether ...

Added: August 1, 2026

Proceedings of the International Science Conference “Scientific research of the SCO countries: synergy and integration” - Reports in English (June 3, 2026. Beijing, PRC)

Scientific publishing house Infinity, 2026.

These Conference Proceedings combine materials of the conference – research papers and thesis reports of scientifi c workers. They examine technical, juridical and sociological aspects of research issues. Some articles deal with theoretical and methodological approaches and principles of research questions of personality professionalization. ...

Added: July 24, 2026

К синтаксису клауз с аспектуальными глаголами в якутском языке

Баркова Л. А., Родной язык: лингвистический журнал 2026 № 1 С. 9–58

This article examines the syntax of constructions with aspectual verbs in Sakha (a.k.a. Yakut). Such constructions contain two predicates: a lexical verb which is a converb, and a finite aspectual verb that conveys some grammatical meaning. The syntax of such constructions has already been studied for some other Turkic languages. The pre‑ sent research examines ...

Added: July 23, 2026

Систематизация равноправных произносительных вариантов в современном русском языке (на материале орфоэпических словарей)

Zubov V., Вопросы лексикографии 2026 № 40 С. 64–86

The article addresses the problem of selecting and systematizing data for the study of pronunciation variation in contemporary Russian and proposes a solution in the form of a specialized database of codified equivalent pronunciation variants (e.g., simmétriya / simmetríya “symmetry”). The article presents a methodology for identifying, selecting, and organizing such variants into a database. ...

Added: July 23, 2026

Библиометрия фольклора: русские пословицы в научных журналах

Pislyakov V., Вестник Томского государственного университета. Филология 2026 № 101 С. 175–192

This article examines the use of proverbs in academic texts—specifically, articles published in Russian research journals. For the experiment, ten proverbs were selected as the intersection of two fundamentally different paremiological surveys aimed at compiling lists of popular or common Russian proverbs. One of these surveys was conducted by the classic of paremiology, G.L. Permyakov, ...

Added: July 22, 2026

Russian Pronouns with Focus Antecedents: Coreference and Binding in Corpora

Tiskin D., Компьютерная лингвистика и интеллектуальные технологии 2026 No. 24 P. 656–665

Despite a lot of interest for the factors influencing the choice of pronoun (reflexive or personal) with an antecedent in Russian, the role of the anaphotic relation—coreference or semantic binding—has been understudied, including disagreements as to the acceptability of particular data points. To clarify things, I employ large corpora (Araneum and GICR) to study the ...

Added: July 19, 2026

Не только ἐπιχώρια διδάγματα: пайдейя Эпаминонда

Mozhaysky A., Schole. Философское антиковедение и классическая традиция 2026 Т. 20 № 2 С. 1105–1116

This article examines the education of Epaminondas, the most famous Theban military and political figure. However, in antiquity, Epaminondas was also renowned for his education and philosophical authority. The study demonstrates that Epaminondas' education encompassed a complex set of local teachings, which Pausanias describes as ἐπιχώρια διδάγματα. However, Epaminondas' education differed from that of most members ...

Added: July 17, 2026

Английский язык для студентов педагогических вузов. = English for Pre-Service Teachers (B2-C1)

Stognieva O., Новикова В. П., М.: Флинта, 2026.

Инновационный курс английского языка для специальных целей для студентов педагогических вузов предлагает погружение в актуальный образовательный дискурс: от вопросов воспитания и когнитивного развития детей и подростков до переосмысления роли школы в цифровую эпоху. Содержательной основой курса выступают аутентичные мультимодальные материалы, позволяющие анализировать глобальные тренды современных образовательных систем и подходов. Издание идеально подходит вузам, стремящимся подготовить ...

Added: July 16, 2026

Вклад Нгуен Тонг Куая в развитие вьетнамской поэзии (Новый взгляд на творчество поэта XVIII века)

Britov I., Вьетнамские исследования 2026 Т. 10 № 2 С. 87–98

The article analyzes the work of the poet of the XVIII century. Nguyen Tong Quai. Attention is drawn to the fact that in Vietnam, only after the proclamation of the policy of renewal, they began to actively study and appreciate his literary legacy, although even during the poet's lifetime, his contemporaries gave extremely positive reviews ...

Added: July 16, 2026

Комитативно-аддитивная полисемия в пуровском диалекте лесного ненецкого языка

Kozlov A., Лапшина К. М., Вопросы языкознания 2026 № 4 С. 132–146

This article examines two functions of the suffix -samae in the Pur dialect of Forest Nenets based on fieldwork data: comitative (expression of jointness: ‘with X’) and scalar additive (focus particle with the meaning ‘even X’). The comitative use of the suffix -samae primarily marks an inanimate companion. However, its use is also possible with other types ...

Added: July 13, 2026

Prompt Design for GPT-4 Assessments of EFL Student Reports

Stognieva O., Murashova N., Journal of Asia TEFL 2026 Vol. 23 No. 2 P. 490–505

This study investigates the impact of different prompt design strategies on the performance of GPT-4 in assessing undergraduate reports within an English as a Foreign Language (EFL) context. As Large Language Models (LLMs) increasingly integrate into educational assessment, understanding how prompt engineering affects grading accuracy and alignment with human judgment is crucial. Three prompt design methods—TELeR Taxonomy, Six strategies ...

Added: July 12, 2026

International Academic Conference. Proceedings of the Scientific Forum “Modern Science: Theory and Practice” (April 22, 2026). Belgrade, Serbia. Part 3.

Scientific publishing house Infinity, 2026.

Scientific Forum Proceedings combine materials of the conference – research papers and thesis reports of scientific workers. They examine technical, juridical and sociological aspects of research issues. Some articles deal with theoretical and methodological approaches and principles of research questions of personality professionalization. ...

Added: July 10, 2026

Этот смутный объект внимания: "реальные предметы" и гаптический опыт в рассказах В. Вулф

Shulyatieva D., Новое литературное обозрение 2026 № 199 С. 128–140

В статье рассмотрена гаптическая образность в поэтике В. Вулф на примере трех ее рассказов («Пятно на стене», «Женщина в зеркале», «Реальные предметы»), в центре которых оказываются предметы, устанавливающие обновленные отношения с героями. С опорой на теорию гаптической визуальности и на теорию вещи описаны трансформации, которые происходят с предметами, и переживание, которое открывается герою и нарратору при соприкосновении с ними, ...

Added: July 10, 2026

Two ga-morphemes in Rutul: Accidental similarity or a case of polygrammaticalization?

Maisak T., Word Structure 2026 Vol. 19 No. 2-3 P. 338–367

In a situation when two or more grammaticalization targets in one language are phonologically identical but functionally distinct, neither polygrammaticalization nor accidental syncretism can be ruled out, especially if we are dealing with a language without historical attestations. In the present paper, I present a detailed account of the coexistence of two homophonous grammatical markers ...

Added: July 9, 2026

Towards a typology of imperative interjections: ‘Take it!’ in the Caucasus

Maisak T., Transactions of the Philological Society 2026 Vol. 124 No. 2 P. 386–427

This paper presents a first typological study of a particular type of imperative interjections, namely interjections with the meaning ‘here, take it!’ used by a speaker when they ask the addressee to take something from the speaker's hands (often combined with a gesture of giving). The sample of languages is both geographically and genealogically restricted ...

Added: July 9, 2026

Light Verb Constructions from a Cross-Linguistic Perspective

Berlin, Boston: De Gruyter, 2025.

Light verb constructions are complex predicates consisting of a semantically reduced verb and an additional often phrasal element contributing the main predicational content. Although light verb constructions have been identified for various (genetically unrelated) languages, a comparative concept which allows identifying light verb constructions across languages is still missing. The present volume approaches this issue ...

Added: July 9, 2026

The Semiotic Intensity Approach: A Scoping Review of Amplification and Attenuation Mechanisms in Multimodal Media Discourse

Yin Z., Terra Linguistica 2026 Vol. 17 No. 2 P. 152–168

Abstract. In the context of global communication, the construction of national images in the media has evolved from passive reporting to active meaning modulation. Using China as a case study, this research introduces the Semiotic Intensity Approach (SIA) to quantify how news media integrate verbal, visual, and layout resources to either amplify or attenuate specific ...

Added: July 8, 2026

Комитет цензуры иностранной как институт культурного трансфера, или судьба итальянских книг и переводов с итальянского в цензурных документах 1830–1850-х годов

Bodrova A. S., Guskov S., Studi Slavistici 2026 Т. 23 № 1 С. 197–212

The article investigates foreign censorship as an institution of cultural transfer in the Russian Empire and its impact on the reception of Italian literature between the 1830s and 1850s. Drawing on archival materials, the authors demonstrate that censorship decisions were determined not only by the norms of the Censorship Statute (1828) but also by a ...

Added: July 5, 2026

Деепричастия в русском языке XVIIв.: переходный период в истории формирования их грамматического значения

Ermolova M., Russian Linguistics 2026 Т. 50 Статья 14

The article analyzes the functioning of gerunds in the Russian language of the 17th century. Basedon the analysis of contexts that are absent in modernRussian, itisconcludedthatinthe 17th century the gerund lost the absolute temporal meaning it once had, acquiring a relative meaning depending on the tense of the main predicate, while remaining, at the same ...

Added: July 4, 2026

Семантика необратимости в медиадискурсе ФРГ: эсхатологические коды и реакция аудитории в условиях кризиса

Moskvina Z. О., Вестник Российского университета дружбы народов. Серия: Литературоведение, журналистика 2026 Т. 31 № 2 С. 398–408

Abstract. This article explores the semantic and cognitive mechanisms governing the functioning of the lexeme “irreversibility” (Unumkehrbarkeit) within contemporary German media discourse covering the crisis in German-Russian relations. The study tests the hypothesis that the use of irreversibility semantics in the mass media serves as a rhetorical strategy intended to reinforce the perception of ongoing ...

Added: July 3, 2026

Глаголы перемещения веществ в славянских языках

Fedorov D., Jezikoslovni Zapiski 2026 Т. 32 № 1 С. 23–52

This article describes verbs denoting motion of liquid and dry substances in Slavic languages. The research explores how Slavic languages lexicalize different situations within the semantic field of substance motion and identifies the parameters that drive this lexicalization (e.g., type of substance, intensity and quantization of flow, and causation). Adjacent grammatical phenomena such as argument ...

Added: May 13, 2026

Proceedings of the 10th Workshop on Slavic Natural Language Processing (Slavic NLP 2025)

Association for Computational Linguistics, 2025.

Added: March 10, 2026

Apposition (Appositional Constructions)

Natalia N. Logvinova, , in: Encyclopedia of Slavic Languages and Linguistics Online.: Brill, 2025. Ch. 11.

Two types of appositional phrases are distinguished in Slavic languages: close and loose. With close constructions, the issues of syntactic headedness and optional case concord between the parts are discussed. Loose appositions are functionally different from close appositions, having a role comparable to secondary predication. ...

Added: December 22, 2025

Nominative Object

Ronko R., Wiemer B., , in: Encyclopedia of Slavic Languages and Linguistics Online.: Brill, 2020.

The nominative object describes a clause type in which the object of a transitive verb takes nominative morphology, and this coding is not conditioned by voice operations. It is a salient property in regions in which Slavic varieties have been in contact with Finnic- and/or Baltic-speaking population, i.e., in the eastern part of the Circum-Baltic ...

Added: December 19, 2025