• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Articles
  • Spoken Corpora of Slavic Languages
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
June 2, 2026
Discovering Science through Russian Language: HSE Prep Year Students Present at International Conference in Kazan
On May 23, 2026, the V International Scientific and Practical Conference ‘Discovering the World of Science’ took place in Kazan at the Preparatory Faculty for International Students of Kazan Federal University. Four students of the HSE International Preparatory Year took part in the event: two delivered their presentations in person, while two participated online. Their work was supervised by Acting Director of the International Prep Year Irina Isaeva and lecturer Ekaterina Kozhemyakova.
May 25, 2026
HSE Scientists Train Neural Network to 'Hear' Faults in Electric Motors
Researchers at the AI and Digital Science Institute of the HSE Faculty of Computer Science have developed a new method—the Signature-Guided Data Augmentation (SGDA) framework—that achieves 99% accuracy in motor fault detection and 86% accuracy in fault classification. The application of this approach can reduce industrial equipment repair costs, minimise downtime, and improve production safety. The study results have been published in Engineering Applications of Artificial Intelligence.
May 25, 2026
'The Humanities Serve as a Conscience'
Maria Mizernaia studies Soviet literature and the history of book publishing. In this interview for the HSE Young Scientists project, she discusses plans to publish a novel about besieged Leningrad, AI-provoked reflections on what it means to be human, and how novels can help satisfy our dopamine hunger.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Spoken Corpora of Slavic Languages

Russian linguistics. 2022.
Dobrushina N., Sokur E.

Spoken corpora are collections of transcribed and annotated audio and /or video recordings of languages or language varieties. The aim of this paper is to present an overview of 51 spoken corpora currently available for Slavic languages and dialects, in particular Belarusian, Bulgarian, Croatian, Czech, Polish, Russian, Slovak, Slovenian, Trasianka, Ukrainian/Rusyn. We identify three groups of corpora according to the type of lect: corpora of standard languages (spoken mainly in an urban environment and existing in both written and oral form), dialects (spoken mainly in a rural environment and unwritten), and bilingual varieties (we call bilingual varieties spoken as L2 by people with different L1 languages, as well as all varieties that evolved in a multilingual environment). We survey the corpora in terms of text registers, transcription, and principles of linguistic and extralinguistic annotation. In conclusion, we suggest a list of features that linguists should take into consideration when developing a spoken corpus. Many spoken corpora are currently being created for various Slavic lects, and their developers may use this overview as a source of information on different designs and solutions.

Research target: Philology and Linguistics
Language: English
Full text
DOI
Text on another site
Keywords: славянские языкиSlavic languageslanguage corporaязыковые корпусаspoken corporaустные корпуса
Publication based on the results of:
Linguistic convergence mechanisms: factors of different order in interaction (2022)
Similar publications
Стратегия оперативного информирования адресата в англоязычном жанре футбольного комментария
Тырыгина В. А., Кабанова И. Н., Вестник Нижегородского государственного лингвистического университета им. Н.А. Добролюбова 2023 № 4 С. 180–191
The focus of this article is the genre of European football (soccer) commentary, considered from the point of view of implementing the strategy of informing the addressee. The authors identify and describe the strategy of informing the addressee and its corresponding tactics in the texts of this genre, using video broadcasts of English team football ...
Added: May 29, 2026
Сборник студенческих работ «Восточная перспектива»
М.: ООО «Адвансед солюшнз», 2026.
Данный выпуск сборника студенческих статей .Восточная перспектива. включает в себя статьи победителей и призеров XI Международной научной студенческой конференции "Восточная перспектива", состоявшейся 18 мая 2024 года. В 2024 году на конференцию было подано 115 заявок, офлайн и онлайн в конференции приняли участие докладчики и слушатели из различных вузов России и ближнего и дальнего Зарубежья. ...
Added: May 29, 2026
Сборник студенческих работ «Восточная перспектива»
М.: ООО «Адвансед солюшнз», 2026.
Данный выпуск сборника студенческих статей «Восточная перспектива» включает в себя статьи победителей и призеров X Международной научной студенческой конференции «Восточная перспектива», состоявшейся 15 апреля 2023 года. Юбилейная конференция стала знаковым событием для студентов различных подразделений НИУ ВШЭ и других вузов России, занимающихся подготовкой востоковедческих кадров. ...
Added: May 29, 2026
Litera
NOTA BENE, 2025.
he article is devoted to the description of a voice message in order to introduce a definition of this communicative phenomenon. Despite the contradictory nature of this phenomenon, its popularity has been growing since 2013 to the present day. However, a voice message still does not have a clear definition, as it has specific characteristics ...
Added: May 27, 2026
Лингвистический анализ рекламы парфюма в англоязычном и русскоязычном дискурсах
Gabrielova E., Шевякова Ю. С., Вестник Удмуртского университета 2026 Vol. 36 No. 2 P. 344–354
In today's globalized world, the effectiveness of sales and the success of products largely rely on well-crafted advertising texts. Influenced by this factor and the growing competition, advertising continuously evolves, incorporating various linguistic, psychological, and cross-cultural techniques. This study focuses on the linguistic and stylistic analysis of perfume advertising texts within English and Russian discourses, ...
Added: May 25, 2026
On the Curse Formula in Wˁzb’s Inscription (RIÉ 192 B, ll. 5-9)
Bulakh M., Aethiopica 2025 Vol. 28 P. 39–52
The article deals with the curse formula belonging to the sixth-century inscription by an Aksumite king Wʿzb (RIÉ 192 B, ll. 5–9). After summarizing the extant interpretations, the author proposes a new reading and interpretation, arguing that the text under scrutiny follows the same pattern and employs the same rhetoric devices as the curse formulas ...
Added: May 23, 2026
Practicamos el Subjuntivo
Bocharov Y., M.: -, 2025.
This textbook is designed for students improving their Spanish proficiency at levels B1-B2. It consists of five topics and a selection of texts to reinforce them. The first topic covers the morphology of the four tenses (present, perfect, imperfect, subjunctive perfect) and exercises on the formation of forms. The remaining topics are devoted to exploring ...
Added: May 23, 2026
Эстетика аудиовизуальной журналистики. Учебное пособие. 2-е издание
Бережная М. А., Novikova A., Кирия И. В., КноРус, 2026.
The aesthetics of journalism is substantiated as a necessary component in the professional training of specialists in audiovisual media. The factors and trends of historical and current changes in the aesthetics of journalism are presented, and the aesthetic practices of audiovisual journalism are characterized in terms of their social functioning. Criteria for aesthetic evaluation are ...
Added: May 22, 2026
Juxtapositional vs. possessive-like encoding in Russian specificational constructions
Logvinova N., Russian linguistics 2026 Vol. 50 Article 11
This paper presents the first in-depth corpus-based study of a previously overlooked syntactic variation in Russian: the competition between juxtapositional (Nominative) and possessive-like (Genitive) encoding of the second noun (the term) in specificational constructions (e.g., ponjatie čest’ (notion.NOM honor.NOM) vs. ponjatie česti (notion.NOMhonor.GEN) ‘the notion of honor’). While typological research has established cross-linguistic preferences for one encoding strategy over another, intralinguistic variation ...
Added: May 18, 2026
FOCUS ON VOCABULARY Экономика материальных и нематериальных активов: корпусный словарь и ИИ-упражнения по английскому языку
Gorina O. G., Kucherenko S., Larisa K. et al., St. Petersburg: Asterion, 2026.
This textbook is an integrated teaching and learning resource for English for Specific Purposes (ESP) in the field of economics of tangible and intangible assets. Its design employs (i) modern corpus linguistics methods, including frequency analysis and keyword extraction based on authentic texts reflecting current trends in professional discourse, and (ii) artificial intelligence technologies for ...
Added: May 16, 2026
КОГНИТИВНО-АССОЦИАТИВНОЕ ПОЛЕ ОНИМОВ САНКТ-ПЕТЕРБУРГА И ВЕНЫ
Зелинская Ю. Ю., Когнитивные исследования языка 2025 № 4(65) С. 180–186
The article focuses on the study of the onym as a cognitive stimulus that facilitates the decoding of the language of urban space across two ethnic groups. The research is grounded in the analysis of results from an onomastic associative experiment, aimed at identifying the dominant types of associative responses to anthroponyms, oikodonyms, hodonyms, and ...
Added: May 16, 2026
Лично-числовая асимметрия: согласование пассивных миративов в казымском диалекте хантыйского языка
Starchenko A., Toldova S., Типология морфосинтаксических параметров 2023 Т. 6 № 1 С. 130–148
The study focuses on a previously unrecorded model of split agreement in the mirative paradigm in Kazym Khanty. Split agreement is found when comparing active and passive mirative constructions, as well as in a limited set of uses of non-finite forms. In the passive voice, unlike the active voice, the 3rd person is unmarked and the ...
Added: May 14, 2026
Глаголы перемещения веществ в славянских языках
Fedorov D., Jezikoslovni Zapiski 2026 Т. 32 № 1 С. 23–52
This article describes verbs denoting motion of liquid and dry substances in Slavic langu­ages. The research explores how Slavic languages lexicalize different situations within the semantic field of substance motion and identifies the parameters that drive this lexicalization (e.g., type of substance, intensity and quantization of flow, and causation). Adjacent gram­matical phenomena such as argument ...
Added: May 13, 2026
Образ женщины сквозь года: диахронический анализ репрезентации женщин в российской агитационной рекламе
Gabrielova E., Максименко О. И., Социальные и гуманитарные науки на Дальнем Востоке 2026 Т. 23 № 1 С. 241–249
The article presents a diachronic analysis of the representation of women in Russian advertising, based on agitation posters from 1917-1990 and social and motivational advertising materials from 2000-2020. The aim of the study is to identify the evolution of verbal and visual strategies for constructing the image of women in the changing socio-political and cultural ...
Added: May 13, 2026
Proceedings of the 9th Student Research Workshop associated with the International Conference Recent Advances in Natural Language Processing
Velichkov B., Nikolova-Koleva I., Slavcheva M., Shumen: INCOMA Ltd, 2025.
The RANLP 2025 Student Research Workshop (RANLPStud’2025) is a special track of the established international conference Recent Advances in Natural Language Processing (RANLP’2025). The RANLPStud is being organised for the 9th time and this year is running in parallel with the other tracks of the main RANLP 2025 conference. The target of RANLPStud’25 is to be a ...
Added: May 12, 2026
Proceedings of the 10th Workshop on Slavic Natural Language Processing (Slavic NLP 2025)
Association for Computational Linguistics, 2025.
Added: March 10, 2026
Apposition (Appositional Constructions)
Natalia N. Logvinova, , in: Encyclopedia of Slavic Languages and Linguistics Online.: Brill, 2025. Ch. 11.
Two types of appositional phrases are distinguished in Slavic languages: close and loose. With close constructions, the issues of syntactic headedness and optional case concord between the parts are discussed. Loose appositions are functionally different from close appositions, having a role comparable to secondary predication. ...
Added: December 22, 2025
Nominative Object
Ronko R., Wiemer B., , in: Encyclopedia of Slavic Languages and Linguistics Online.: Brill, 2020.
The nominative object describes a clause type in which the object of a transitive verb takes nominative morphology, and this coding is not conditioned by voice operations. It is a salient property in regions in which Slavic varieties have been in contact with Finnic- and/or Baltic-speaking population, i.e., in the eastern part of the Circum-Baltic ...
Added: December 19, 2025
Диалектометрический подход к диалектной классификации восточнославянских языков на материале сборника «Восточнославянские изоглоссы»
Manusov A. V., Кузьмина А. С., Вопросы языкового родства 2024 № 22/3-4 С. 342–366
The article proposes a new dialectometric approach to the division of East Slavic languages. Our dialectometry is based on the material from the collection of articles “Vostochnoslavyanskie izoglossy” (“East Slavic isoglosses”, 1995–2006), which is a generalization of data from atlases of East Slavic languages (Dialectological atlas of the Russian language, Dialectological atlas of the Belarusian ...
Added: November 13, 2025
Палеославистика 6. Славянское и балканское языкознание. Выпуск 25
Савич В., Паскаль А. Д., Вершинин К. В. et al., Полимедиа, 2025.
The volume of the “Slavic and Balkan Linguistics” series presents the monograph “Palaeoslavistica – 6” written by the international team of researchers. The sections of the co-authored monograph are devoted to the latest results of the ongoing research of the Slavic manuscripts written in the 10th–14th centuries, their language, textology, and palaeography. ...
Added: November 12, 2025
BERT-like Models for Slavic Morpheme Segmentation
Morozov D., Astapenka L., Glazkova A. et al., , in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)Vol. 1: Long papers.: Association for Computational Linguistics, 2025. P. 6795–6815.
Automatic morpheme segmentation algorithms are applicable in various tasks, such as building tokenizers and language education. For Slavic languages, the development of such algorithms is complicated by the rich derivational capabilities of these languages. Previous research has shown that, on average, these algorithms have already reached expert-level quality. However, a key unresolved issue is the ...
Added: July 17, 2025
Обзор семейства конструкций с функцией «понижения агенса» в славянских языках
Plungian V., Подгорная А. Д., Славистика 2023 Т. 27 № 2 С. 54–70
В данной работе представлен обзор конструкций, выполняющих функцию «понижения агенса», в славянских языках, что включает причастный пассив, субъектный имперсонал с кратким пассивным причастием (на -no/to), форма с континуантом праславянского *sę, в разных языках демонстрирующая свойства пассива или имперсонала, конструкции с глаголом в форме 3-го лица мн.ч. и ед.ч. (ср.р.), универсальные употребления 2-го лица ед.ч., 1-го ...
Added: June 6, 2024
Национальный корпус русского языка 2.0: новые возможности и перспективы развития
Савчук С. О., Архангельский Т. А., Bonch-Osmolovskaya A. A. et al., Вопросы языкознания 2024 № 2 С. 7–34
The paper provides an overview of the results of the fundamental reconstruction and modernization project of the National Corpus of the Russian Language platform, carried out from 2020 to 2023. The focus of the paper is on the new opportunities that are opening up for linguists and a wider audience. This includes improving the representativeness ...
Added: March 21, 2024
Корпусное исследование конкуренции конструкций с функцией «понижения агенса» в славянских языках
Plungian V., Подгорная А. Д., Studia Slavica 2022 Т. 67 № 1-2 С. 115–131
В статье рассматриваются конструкции с функцией «понижения агенса» и их переводные эквиваленты на материале параллельного корпуса романа М. А. Булгакова «Мастер и Маргарита» в переводах на польский, чешский, болгарский, сербский и немецкий языки. Под данным ярлыком объединяются средства, лишающие агенс привилегированного коммуникативного статуса, что проявляется в его реализации в нехарактерной синтаксической позиции, полном опущении или ...
Added: November 8, 2023
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit