Russian sentence corpus: benchmark measures of eye movements in reading in cyrillic

A. Laurinavichyute; I. A. Sekerina; Alekseeva S.; Bagdasaryan K.

?

Russian sentence corpus: benchmark measures of eye movements in reading in cyrillic

P. 439–443.

Laurinavichyute A., Sekerina I. A., Alekseeva S., Bagdasaryan K.

We describe the Russian Sentence Corpus (RSC) that establishes benchmarks of eye movements in reading in Cyrillic. The RSC design follows the cross-linguistic protocol of the Potsdam Sentence Corpus for German (Kliegl et al. 2004). The RSC consists of 144 sentences that include target words of three parts of speech (i.e., nouns, verbs, and adjectives) and the corresponding eye-tracking while reading data from 96 young native speakers of Russian reading these sentences. The basic characteristics of eye movements while reading in Russian were described and compared to those of German. In general, the basic characteristics of eye-movements were similar across languages, although Russian manifests systematic differences in the way word length affects reading times, which we tentatively attribute to the morphological structure of Russian words.

Language: English

Full text

Keywords: reading Russian language eye movements eye tracking

In book

Когнитивная наука в Москве: новые исследования. Материалы конференции 15 июня 2017 г.

Буки Веди, 2017.

Juxtapositional vs. possessive-like encoding in Russian specificational constructions

Logvinova N., Russian linguistics 2026 Vol. 50 Article 11

This paper presents the first in-depth corpus-based study of a previously overlooked syntactic variation in Russian: the competition between juxtapositional (Nominative) and possessive-like (Genitive) encoding of the second noun (the term) in specificational constructions (e.g., ponjatie čest’ (notion.NOM honor.NOM) vs. ponjatie česti (notion.NOMhonor.GEN) ‘the notion of honor’). While typological research has established cross-linguistic preferences for one encoding strategy over another, intralinguistic variation ...

Added: May 18, 2026

Дискриминативная лемматизация сокращений в эпоху LLM

Глазкова А. В., Смаль И. В., Lyashevskaya O. et al., Доклады Российской академии наук. Математика, информатика, процессы управления (ранее - Доклады Академии Наук. Математика) 2025 Т. 527 С. 146–155

This paper presents a study on the effectiveness of discriminative methods for abbreviation lemmatization in Russian texts. Unlike generative approaches, discriminative models select the optimal lemma from a fixed set of candidates, eliminating the risk of generating grammatically incorrect word forms. For the first time in Russian language processing, we conduct a comprehensive analysis of ...

Added: March 10, 2026

Rubic2: Ensemble Model for Russian Lemmatization

Afanasev I., Glazkova A., Lyashevskaya O. et al., , in: Proceedings of the 10th Workshop on Slavic Natural Language Processing (Slavic NLP 2025).: Association for Computational Linguistics, 2025. P. 157–170.

Pre-trained language models have significantly advanced natural language processing (NLP), particularly in analyzing languages with complex morphological structures. This study addresses lemmatization for the Russian language, the errors in which can critically affect the performance of information retrieval, question answering, and other tasks. We present the results of experiments on generative lemmatization using pre-trained language ...

Added: March 10, 2026

Transformer-based approaches for lemmatizing abbreviations in Russian texts

Glazkova A., Lyashevskaya O., Morozov D. et al., Journal of Mathematical Sciences 2025 Vol. 546 P. 32–47

This paper addresses the task of lemmatizing abbreviations in the Russian language. Abbreviation lemmatization is particularly challenging, as it involves not only transforming a word into its normal form but also correctly expanding the abbreviation. We explore two approaches to this task, both leveraging large pretrained language models. The first approach is generative, where the ...

Added: March 10, 2026

Правовое положение соотечественников, проживающих в постсоветских странах, в условиях нестабильной международной обстановки

Затулин К. Ф., Егоров В. Г., Докучаева А. В. et al., М.: Институт диаспоры и интеграции (Институт стран СНГ), 2025.

Книга «Правовое положение соотечественников, проживающих в постсоветских странах, в условиях нестабильной международной обстановки» содержит результаты исследования, проведенного в Абхазии, Азербайджане, Армении, Беларуси, Грузии, Казахстане, Киргизии, Латвии, Литве, Молдове, Приднестровской Молдавской Республике, Таджикистане, Узбекистане, Эстонии и Южной Осетии. Исследование выполнено Институтом диаспоры и интеграции (Институтом стран СНГ) в 2024 году. Оно включило в себя анализ нормативно-правовых ...

Added: February 3, 2026

Moderation analysis of subjective well-being, self-efficacy, and academic performance of 4th grade children in Russia

Akhmedjanova D., Kanonire T., Zakharov A., Plos One 2026 Vol. 21 No. 2 Article e0341318

Research evidence exists on associations of subjective well-being, self-efficacy and various academic and non-academic outcomes with older students; however, there is a research gap on how these variables relate to each other in elementary school students. The goal of this cross-sectional study within a larger longitudinal project was to examine the role of subjective well-being ...

Added: February 3, 2026

Методика обучения младших школьников чтению на русском и английском языках: сходство и различие

[б.и.], 2022.

The article highlights the importance of the role of teaching reading to children, its specific features and components; the main methods used in teaching reading to children both in Russian and in English are considered; a comparative characteristic of the two languages is made. In addition, the article also compares the methods of teaching reading ...

Added: January 31, 2026

Semi-fake indexicals in Russian

Тискин Д. Б., Типология морфосинтаксических параметров 2025 Vol. 8 No. 1 P. 112–129

There are several rival theories of fake indexicals, i.e. bound indexicals (prominently pronouns) whose φ-features do not semantically contribute to focus alternatives (e.g. Only Mary did her homework, John didn’t do his). According to Minimal Pronoun theories (such as Kratzer’s or Wurmbrand’s), bound pronouns are Merged without φ-features and acquire them under binding via agreement-like ...

Added: January 26, 2026

Некоторые модификации к теории связанных употреблений индексальных выражений И. Басси

Тискин Д. Б., Типология морфосинтаксических параметров 2024 Т. 7 № 1 С. 107–123

Fake indexicals (FIs), or bound-variable uses of e.g. 1st - and 2 nd -person pronouns, have been analysed by Bassi (2021) as arising from a post-syntactic process of inspecting the features of the referent. This leads to a peculiar analysis of the syntax and semantics of relative clauses containing FIs. I argue for a more ...

Added: January 26, 2026

Automatic detection of dyslexia based on eye movements during reading in Russian

Laurinavichyute A., Lopukhina A., Reich D., , in: Proceedings of the 63rd Annual Meeting of the Association for Computational LinguisticsVol. 2: Short papers.: Wien: Association for Computational Linguistics, 2025. P. 59–66.

Dyslexia, a common learning disability, requires an early diagnosis. However, current screening tests are very time- and resourceconsuming. We present an LSTM that aims to automatically classify dyslexia based on eye movements recorded during natural reading combined with basic demographic information and linguistic features. The proposed model reaches an AUC of 0.93 and outperforms the ...

Added: January 19, 2026

Interplay between switching, inhibition, and mental attention: An exploratory eye-tracking study

Zhanna Chuikova, Izmalkova A., Myachykov A. et al., Psychological Research 2026 Vol. 90 Article 19

Cognitive flexibility (CF) allows individuals to adapt their behavior to changing environmental demands. As task complexity increases, CF may substantially impact performance by facilitating a shift towards more efficient information processing strategies. However, its role in tasks with high cognitive demands remains largely unexplored. Furthermore, while CF is associated with inhibitory control and working memory ...

Added: January 13, 2026

Проблема формирования национального самосознания у детей в процессе изучения родного языка в трудах К. Д. Ушинского

Бизяева Н. Д., Проблемы современного образования 2025 № 4 С. 134–141

This study is the result of understanding the views of K. D. Ushinsky on the problem of forming national self-awareness in children in the process of studying their native language. It was determined that the idea of nationality, expressed in the theoretical and axiological principles of K. D. Ushinsky, was quite clearly expressed in “The ...

Added: December 16, 2025

Detecting Ethnic Conflict in Social Media with Transformers and Augmented Data

Koltsova O., Surkov A., Procedia Computer Science 2025 Vol. 258 P. 2382–2390

Chest X-ray pathology prediction play a very important role in early disease detection, enabling timely intervention and improving patient outcomes. Detection of ethnic conflict mentioning, discussion, or verbal participation therein in user-generated content is a socially important task, as such content has been proven related to ethnic clashes on the ground. Yet this task has not been ...

Added: November 28, 2025

Речевые акты с вежливыми диминутивами: жанровые и дискурсивные особенности

Fufaeva I., Вестник Волгоградского государственного университета. Серия 2: Языкознание 2025 Т. 24 № 4 С. 78–90

This study delves into speech acts utilizing diminutives for politeness, focusing on their discursive and genre-related aspects. It draws on authorial recordings of spoken discourse, data from the National Corpus of the Russian Language, and recordings of urban speech from the 1970s and late twentieth century. The research highlights the potential usage of polite diminutives in ...

Added: November 25, 2025

Интерпретация сложных предложений с разными типами матричных предикатов в контексте отрицания и модальных операторов

Letuchiy A., Russian Linguistics 2025 Т. 49 № 2 Статья 2

The article discusses types of interpretation that Russian complex sentences with factive,implicative and interpretation verbs get under negation and modal operators. By default,the external negative and modal context affects only the main situation. However, one findsexceptions of this rule. We call ‘transparent readings’ those readings in which the exter-nal context affects semantically both the matrix ...

Added: November 5, 2025

Analysis Of Eye And Head Tracking Movements During Shooting From The Prone Position In Biathletes Compared To Novices

Kruchinina A. P., I. S. Polikanova, Psychology. Journal of the Higher School of Economics 2025 Vol. 22 No. 3 P. 473–488

Biathlon shooting is one of the most cru cial aspects that determines an athlete's success. Several factors can affect shoot ing performance. This research was aimed at studying a range of eye and head movement parameters in biathletes of different skill levels as well as in novices in order to identify the most rel evant and ...

Added: October 3, 2025

Gender stereotypes in agreement processing with role nouns: a study on Russian

Slioussar N., Antropova D., Frontiers in Psychology 2025 Vol. 16 Article 1619505

The majority of Russian nouns denoting professions and social roles are grammatically masculine. Some of them have feminine pairs, the others do not, but in modern Russian, most nouns in this group can be used to refer to women — either with masculine or with feminine agreement. This option has some interesting limitations that have ...

Added: September 22, 2025

Новые номинации мужчин в молодежном сленге

Krongauz M., Труды института русского языка им. В.В. Виноградова 2025 № 3(45) С. 159–167

The article is devoted to modern youth slang, namely to the nominations of men that have appeared most recently: ank, masik, normis, sigma, skuf, tubik, chechik, shtrikh. It is noted that the words masik, tubik, chechik, shtrikh are often discussed together on the Internet and have common semantic and pragmatic characteristics. They denote types of ...

Added: September 17, 2025