Intra-speaker stress variation in Russian: A corpus-driven study of Russian poetry

A. Piperski; A. Kukhto

?

Intra-speaker stress variation in Russian: A corpus-driven study of Russian poetry

Компьютерная лингвистика и интеллектуальные технологии. 2016. P. 540–550.

Piperski A., Kukhto A.

Russian lexical stress exhibits both inter-speaker variation, defined by the speaker’s regional affiliation, social status, age, etc., as well as intra-speaker variation. The latter is difficult to capture due to the need for large corpora of spoken text produced by one speaker. These are lacking, but can be replaced with poetic corpora. We use automatic analysis of poetic texts by 10 poets, drawn from the Russian National Corpus, in order to find word forms that can have stress variation. The number of such forms for an individual speaker ranges between 30 and 200 words, distributed among different parts of speech. We propose a quantitative measure of overall stress variability independent of the corpus size and show that there is a tendency for this variability to diminish over time, at least in poetic texts.

Priority areas: humanitarian

Language: English

Full text

Keywords: вариативность корпусная лингвистика ударение corpus linguistics language variation Russian accent

Publication based on the results of:

The Russian Language and Modern Trends in Communication (2016)

Российская социология в условиях цифровизации общества: результаты анализа корпуса научных текстов

Смирнов А. В., Социологические исследования 2023 № 4 С. 39–50

Using the analysis of a corpus of texts from eight leading Russian sociological journals, the article examines the impact of the digitalization of society on sociology in 2000–2021. Frequency analysis of 13.8 thousand scientific texts tracked the introduction of concepts related to digitalization into academic circulation. The article reveals the differences between the journals, due ...

Added: March 18, 2026

Promotional adjectives in grant proposal abstracts: a corpus study

Tulyakov D., Permyakova T. M., Balezina E., Вестник Волгоградского государственного университета. Серия 2: Языкознание 2025 Vol. 24 No. 6 P. 58–67

By effectively integrating promotional discourse into grant proposal abstracts, researchers can more compellingly present their ideas and increase their chances of securing funding. Implications of promotional adjectives in grant writing might differ across various research fields. This study aims to explore the use of promotional adjectives in abstracts of research grant proposals in six research ...

Added: March 2, 2026

Kirina M., Лукьянчикова А. С., В кн.: Язык в эпоху цифровых трансформаций и развития искусственного интеллекта : Сборник научных статей по итогам II Международной научной конференции Минск, 23–24 октября 2025 г.: Мн.: БГУИЯ, 2025. С. 74–85.

В статье рассматриваются характерные особенности гороскопических текстов как части астрологического дискурса. Материалом исследования выступает представительная выборка ежедневных предсказаний на русском языке, опубликованных в открытых группах социальной сети «ВКонтакте», суммарным объемом 1185425 словоупотреблений. С использованием методов корпусной и компьютерной лингвистики анализируются содержательные лексические единицы – как общие, так и отличительные для каждого знака зодиака (в сопоставлении ...

Added: February 28, 2026

Динамика восприятия площадей в пространстве города носителями русского языка (сравнительный анализ по данным НКРЯ)

Belova P., В кн.: Актуальные вопросы лингвистики и литературоведения: сборник научных статей по материалам международной научной конференции памяти доктора филологических наук, профессора Л.А. Араевой (6–8 февраля 2025).: Кемеровский государственный университет, 2025. С. 155–160.

This article contains research results on the dynamics of squares’ perception in the city space in the Russian language picture of the world over time, starting from the second half of the XXth century to the present. Turning to the subcorpus of literary texts of the second half of the XXth century and the XXIst ...

Added: February 4, 2026

Языковая концептуализация пространства в художественном тексте (по данным НКРЯ)

Belova P., В кн.: Когнитивные исследования языка. Вып. №1 (62): материалы Международной научной конференции по когнитивной лингвистике. 5-7 июня 2025. Ч. 2Ч. 2. Кн. 62. Вып. 1.: ТюмГУ-Press, 2025. С. 56–60.

Данная статья представляет результаты изучения содержания концепта ПРОСТРАНСТВО в русском языковом сознании на материале художественных прозаических текстов разных жанров, созданных во второй половине XX века и в XXI веке и представленных в НКРЯ. Анализ проведен с учетом таких культурно-языковых фильтров, как пропозициональные установки, предметно-понятийные корреляции и метафорические преобразования. ...

Added: February 4, 2026

Два подхода к дифференциации терминов миграционных исследований (по данным корпусного анализа)

Permyakova T. M., Smirnova E. A., Новые исследования Тувы 2025 № 4 С. 122–136

The article presents a quantitative and qualitative analysis of English-language terms related to the study of migration.The sources used were research articles in the social sciences published between 2018 and 2020 in international first-quartile journals indexed in the Scopus database. The corpus-linguistic study addresses two objectives: to identify functioning systems of terms in scientific articles ...

Added: December 1, 2025

Preposition drop in Russian spoken by Mari and Beserman bilinguals

Yakovleva A., Kosheliuk N., Moroz G., International Journal of Bilingualism 2025 P. 1–19

Aims and Research Questions: In this paper, we present a corpus-based study of preposition drop (p-drop) in the speech of Mari-Russian and Beserman-Russian bilinguals compared to the speech of Russian monolinguals. Based on data from spoken corpora, we demonstrate that the prepositions v ‘in’, k ‘to’, s ‘with’ are omitted in the speech of bilinguals ...

Added: November 26, 2025

Вариативность годов vs. лет в русских говорах: корпусное исследование

Zemicheva S., Moroz G., Naccarato C., Вопросы языкознания 2025 № 6 С. 7–34

Наличие супплетивной формы лет в парадигме существительного год отличает русский язык от других восточнославянских. При этом в русских говорах вместо лет может использоваться вариант годов. Данные панхронического подкорпуса НКРЯ показывают, что форма годов, зафиксированная впервые в XV в., на всем протяжении истории русского языка была периферийной, в XVII–XVIII вв. использовалась преимущественно в нехудожественных текстах, а в ...

Added: November 12, 2025

Automatic Annotation of Discourse and Speech Formulas in Internet Communication: A Telegram Comment Corpus

Maslenikova A., Tatiana I. Popova, , in: 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part I. Speech and Computer. Lecture Notes in Artificial Intelligence 16187Vol. 16187: Lecture Notes in Artificial Intelligence.: Springer, 2025. P. 278–292.

This article presents a system for the automatic processing of user comments aimed at annotating speech and discourse formulas that actively function in everyday interaction, including digital communication. A Python-based program using the Telegram API was developed to automate the collection, filtering, and annotation of empirical data. In addition to building a user corpus, the ...

Added: October 19, 2025

27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part II. Speech and Computer. Lecture Notes in Artificial Intelligence 16188

Springer, 2025.

This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or ...

Added: October 19, 2025

Политическая аккомодация культурных различий в индустриально развитых обществах (Political Accommodation of Cultural Differences in Industrialized Societies)

Малахов В. С., Симон М. Е., Летняков Д. Э. et al., / SSRN. Серия Social Science Research Network "Social Science Research Network". 2020.

The notion of “political accommodation” applied to the theory and practice of managing cultural diversity could enrich the Russian academic dictionary. Liberal democratic states invented specific mechanisms for political accommodation of cultural differences. Thanks to these mechanisms, the part of the population of a democratic state that is not ready to dissolve into the ethnocultural ...

Added: September 26, 2025

Национальная мощь современных государств: сравнительный анализ. Аналитический доклад

Melville A. Y., Каберник В. В., Mironyuk M. et al., / МГИМО МИД России. 2024.

Данный аналитический доклад является одним из результатов исследований в рамках консорциума НИУ ВШЭ и МГИМО. В нем прежде всего раскрыты вопросы концептуализации национальной мощи и сопутствующих категорий и дается обзор прецедентов. Далее рассматриваются вопросы операционализации предлагаемых нами компонентов национальной мощи. В следующих разделах доклада предлагается анализ вопросов методологии, используемой в докладе. На этой основе предложен ...

Added: September 19, 2025

Variation in a Narrative Corpus of Mano and Kpelle: Contact-Induced or Not?.

Khachaturyan M., Konoshenko M., Moroz G. et al., , in: N’yng-dyuumgu, n’yng-ngafq: Festschrift for Ekaterina GruzdevaVol. 126.: Helsinki: Studia Orientalia, 2025. P. 35–59.

This paper explores a corpus of spontaneous narratives and narrative retellings told by children and adults in Mano and Kpelle, two contacting Mande languages. It focuses on quotative constructions as a key point of grammatical dissimilarity between Mano and Kpelle. In the Mano speech of some bilingual children, however, these constructions are found to manifest ...

Added: September 5, 2025

Анализ тематики повседневных разговоров: экспертный подход и автоматические методы

Sherstinova T., Вепринцева Д. А., Человек: образ и сущность. Гуманитарные аспекты 2025 № 2(62) С. 89–108

В статье рассматриваются три разных подхода к изучению тематики повседневных разговоров: экспертная тематическая разметка и два автоматических метода (тематическое моделирование и кластеризация). Материалом для исследования послужили расшифровки русской устной повседневной речи из корпуса ОРД, подготовленные на основе звукозаписей спонтанных разговоров, выполненных в естественных коммуникативных ситуациях (дома, на работе, в учебном заведении, в магазине, в поликлинике ...

Added: September 3, 2025

ФУНКЦИОНИРОВАНИЕ ДИАЛЕКТА В КОММЕРЧЕСКОЙ РЕКЛАМЕ ЮЖНОЙ ГЕРМАНИИ

Пономарёва А. А., В кн.: LII Международная научная филологическая конференция имени Людмилы Алексеевны Вербицкой, 19–26 марта 2024 года, Санкт-Петербург: сборник тезисов.: СПб.: Издательство СПбГУ, 2024. С. 922–923.

Рассматривается роль диалекта в коммерческой баннерной и плакатной рекламе Южной Германии. Были проанализированы рекламные кампании, проведенные на территории Ба варии и Баден-Вюртемберга с 2010 по 2022 гг., с вкраплениями южнонемецких диалектов. Ре зультаты лингвопрагматического анализа свидетельствуют о том, что использование диалекта в региональных рекламных сообщениях детерминировано как коммуникативными запросами общества, так и функциями рекламы как ...

Added: September 2, 2025

Explicit continuum scale format reduces the ceiling effect in self-report questionnaires comparing to Likert response format

Antipkina I., Ivanov A., Guzhelya D., / Series WP BRP "Basic research program". 2024.

This study presents a methodology for developing a new questionnaire format called explicit continuum scenario scales, in the example of a client focus questionnaire. Elements of the Rasch Guttman scenario scale methodology were used in its development. In three consequent studies, different aspects of the scale functioning were investigated. In Study 1, on the sample ...

Added: February 21, 2025

Language In The Construction Of Ethnicity Among Koreans In Kazakhstan: A Case Study Of The Korean Community In Karaganda

Aitzhanov S., / NRU HSE. Series WP BRP "Linguistics". 2024. No. 116.

This study focuses on examining the role of language as an attribute in the construction of ethnicity within the Korean community in Kazakhstan. The research examines how language functions as an attribute in the categorization and identification processes, and how it interacts with other ethnic attributes such as descent and appearance. Drawing on qualitative methods, ...

Added: December 10, 2024

CONNECTING ANCIENT AND MODERN: THE MEDIEVAL PLOT ABOUT THE FOX AND THE DISCUSSION BETWEEN GOETHE AND SCHILLER ABOUT GERMAN EPIC POETRY

Микаелян А. Л., / NRU Higher School of Economics. Series WP BRP "Literary Studies". 2024. No. 28.

The article presents an attempt to examine Goethe's poem "Reineke Fox" in connection with the discussion between Goethe and Schiller about the nature of epic poetry and the principles of its renewal within the poetics of "Weimar Classicism". Goethe introduced into his interpretation of the medieval story of the Fox a number of innovations at various ...

Added: November 19, 2024

A Language Model for Grammatical Error Correction in L2 Russian

Remnev N., Obiedkov S., Rakhilina E. V. et al., / Series Computer Science "arxiv.org". 2023.

Grammatical error correction is one of the fundamental tasks in Natural Language Processing. For the Russian language, most of the spellcheckers available correct typos and other simple errors with high accuracy, but often fail when faced with non-native (L2) writing, since the latter contains errors that are not typical for native speakers. In this paper, ...

Added: October 30, 2024

You shall know a piece by the company it keeps. Chess plays as a data for word2vec models

Orekhov B., / Series Computer Science "arxiv.org". 2024.

In this paper, I apply linguistic methods of analysis to non-linguistic data, chess plays, metaphorically equating one with the other and seeking analogies. Chess game notations are also a kind of text, and one can consider the records of moves or positions of pieces as words and statements in a certain language. In this article ...

Added: August 8, 2024

How does Burrows' Delta work on medieval Chinese poetic texts?

Orekhov B., / Series Computer Science "arxiv.org". 2024.

Burrows' Delta was introduced in 2002 and has proven to be an effective tool for author attribution. Despite the fact that it was applied to different languages, they mostly belong to the same grammatical type and use the same graphic principle to convey speech in writing: a phonemic alphabet with word separation using spaces. The question ...

Added: August 8, 2024

Does Delta really confirm that Rowling and Galbraith are the same author?

Orekhov B., / Series Computer Science "arxiv.org". 2024.

Added: August 8, 2024

Difficulty overdose? Inconclusive effect of the disfluent font on reading in second language

Tsigeman-Gorenko E., Likhanov M., Kalinnikova L. et al., / Series 00 "00". 2024.

Multiple studies show that reading in hard-to-read (dysfluent) fonts can enhance memory and comprehension of learnt material, but it is unclear if this effect extends to second language (L2) learning. This study investigated the impact of dysfluent fonts on L2 text memorisation and comprehension, accounting for learners’ individual differences (gender, L2 anxiety, L2 proficiency and L1 vocabulary size) ...

Added: June 10, 2024

Digital resources on the Uralic languages of Siberia: an overview, evaluation and application.

Кошелюк Н. А., Fedotova I., / OSF Preprints. Серия 0 "Arts and Humanities". 2023.

The paper reviews digital resources on the Uralic languages of Siberia, defining their scope and testing applicability for different tasks in linguistic studies. We focus on the resources featuring at least one Uralic language of Siberia. A short case-study for Nenets outlines applicability of the resources and the types of data that can be obtained ...

Added: May 12, 2024