Эрратологическая разметка корпуса русских учебных текстов: тактические решения

?

Эрратологическая разметка корпуса русских учебных текстов: тактические решения

Труды института русского языка им. В.В. Виноградова. 2019. Т. XXII. С. 11–22.

Created by the School of Linguistics of the Faculty of Humanities at the National
Research University Higher School of Economics, the Corpus of Russian Student Texts
(CoRST) includes texts belonging to such genres as answers to various questions, argumentative
statements, essays, course papers etc., which were written either spontaneously
(in the classroom) or as prepared texts (at home) by students in Bachelor's degree
programs.
In the process of studying academic writing, students pass through different stages of
understanding how to structure academic texts.
At each stage, the interference of different styles and genres, the heterogeneous nature
of received speech patterns as well as low levels of self-correction lead to inevitable
systemic errors in grammar and grammatical stylistics, semantics and text pragmatics.
The deviations from standard speech reflect both the stadial nature of academic writing
skills and the processes characteristic of speech system dynamics in general; the formation
of new customary (usual) norms on the remains of obsolete (conservative) norms
demonstrates the limits of variability in the usage of words and word forms.
These deviations are marked by a system of tags developed and optimized by the Corpus
team (N. A. Zevakhina, S. S. Dzhakupova, Yu. M. Kuvshinskaya, S. Yu. Puzhaeva,
with active assistance from colleagues and students).
The error markup contains lexical, morphological, and discursive information.
The grammatical section shows the frequency of deviations from morphological and
syntactical patternsare connected with the slackening of a number of constructions.
For example, there are such challenges as the broadening of a number of ‘light’ verbs
(units devoid of semantic value and satisfying the syntactic needs of a statement, whose lexical meaning is delegated to a governed word); the choice of case for governed nouns;
comparative and intensifying constructions; and anaphoric usage.
The article considers specific examples marked with the tag “agreement error” (agr).
The motivation for markup when choosing a marker for a speech fragment is discussed.

Research target: Philology and Linguistics

Priority areas: humanitarian

Language: Russian

Full text

Text on another site

Publication based on the results of:

The dynamics of the standard and the marginal in Russian language (2019)

Proceedings of the International Science Conference “Scientific research of the SCO countries: synergy and integration” - Reports in English (June 3, 2026. Beijing, PRC)

Scientific publishing house Infinity, 2026.

These Conference Proceedings combine materials of the conference – research papers and thesis reports of scientifi c workers. They examine technical, juridical and sociological aspects of research issues. Some articles deal with theoretical and methodological approaches and principles of research questions of personality professionalization. ...

Added: July 24, 2026

К синтаксису клауз с аспектуальными глаголами в якутском языке

Баркова Л. А., Родной язык: лингвистический журнал 2026 № 1 С. 9–58

This article examines the syntax of constructions with aspectual verbs in Sakha (a.k.a. Yakut). Such constructions contain two predicates: a lexical verb which is a converb, and a finite aspectual verb that conveys some grammatical meaning. The syntax of such constructions has already been studied for some other Turkic languages. The pre‑ sent research examines ...

Added: July 23, 2026

Систематизация равноправных произносительных вариантов в современном русском языке (на материале орфоэпических словарей)

Zubov V., Вопросы лексикографии 2026 № 40 С. 64–86

The article addresses the problem of selecting and systematizing data for the study of pronunciation variation in contemporary Russian and proposes a solution in the form of a specialized database of codified equivalent pronunciation variants (e.g., simmétriya / simmetríya “symmetry”). The article presents a methodology for identifying, selecting, and organizing such variants into a database. ...

Added: July 23, 2026

Библиометрия фольклора: русские пословицы в научных журналах

Pislyakov V., Вестник Томского государственного университета. Филология 2026 № 101 С. 175–192

This article examines the use of proverbs in academic texts—specifically, articles published in Russian research journals. For the experiment, ten proverbs were selected as the intersection of two fundamentally different paremiological surveys aimed at compiling lists of popular or common Russian proverbs. One of these surveys was conducted by the classic of paremiology, G.L. Permyakov, ...

Added: July 22, 2026

Russian Pronouns with Focus Antecedents: Coreference and Binding in Corpora

Tiskin D., Компьютерная лингвистика и интеллектуальные технологии 2026 No. 24 P. 656–665

Despite a lot of interest for the factors influencing the choice of pronoun (reflexive or personal) with an antecedent in Russian, the role of the anaphotic relation—coreference or semantic binding—has been understudied, including disagreements as to the acceptability of particular data points. To clarify things, I employ large corpora (Araneum and GICR) to study the ...

Added: July 19, 2026

Не только ἐπιχώρια διδάγματα: пайдейя Эпаминонда

Mozhaysky A., Schole. Философское антиковедение и классическая традиция 2026 Т. 20 № 2 С. 1105–1116

This article examines the education of Epaminondas, the most famous Theban military and political figure. However, in antiquity, Epaminondas was also renowned for his education and philosophical authority. The study demonstrates that Epaminondas' education encompassed a complex set of local teachings, which Pausanias describes as ἐπιχώρια διδάγματα. However, Epaminondas' education differed from that of most members ...

Added: July 17, 2026

Английский язык для студентов педагогических вузов. = English for Pre-Service Teachers (B2-C1)

Stognieva O., Новикова В. П., М.: Флинта, 2026.

Инновационный курс английского языка для специальных целей для студентов педагогических вузов предлагает погружение в актуальный образовательный дискурс: от вопросов воспитания и когнитивного развития детей и подростков до переосмысления роли школы в цифровую эпоху. Содержательной основой курса выступают аутентичные мультимодальные материалы, позволяющие анализировать глобальные тренды современных образовательных систем и подходов. Издание идеально подходит вузам, стремящимся подготовить ...

Added: July 16, 2026

Вклад Нгуен Тонг Куая в развитие вьетнамской поэзии (Новый взгляд на творчество поэта XVIII века)

Britov I., Вьетнамские исследования 2026 Т. 10 № 2 С. 87–98

The article analyzes the work of the poet of the XVIII century. Nguyen Tong Quai. Attention is drawn to the fact that in Vietnam, only after the proclamation of the policy of renewal, they began to actively study and appreciate his literary legacy, although even during the poet's lifetime, his contemporaries gave extremely positive reviews ...

Added: July 16, 2026

Комитативно-аддитивная полисемия в пуровском диалекте лесного ненецкого языка

Kozlov A., Лапшина К. М., Вопросы языкознания 2026 № 4 С. 132–146

This article examines two functions of the suffix -samae in the Pur dialect of Forest Nenets based on fieldwork data: comitative (expression of jointness: ‘with X’) and scalar additive (focus particle with the meaning ‘even X’). The comitative use of the suffix -samae primarily marks an inanimate companion. However, its use is also possible with other types ...

Added: July 13, 2026

Prompt Design for GPT-4 Assessments of EFL Student Reports

Stognieva O., Murashova N., Journal of Asia TEFL 2026 Vol. 23 No. 2 P. 490–505

This study investigates the impact of different prompt design strategies on the performance of GPT-4 in assessing undergraduate reports within an English as a Foreign Language (EFL) context. As Large Language Models (LLMs) increasingly integrate into educational assessment, understanding how prompt engineering affects grading accuracy and alignment with human judgment is crucial. Three prompt design methods—TELeR Taxonomy, Six strategies ...

Added: July 12, 2026

International Academic Conference. Proceedings of the Scientific Forum “Modern Science: Theory and Practice” (April 22, 2026). Belgrade, Serbia. Part 3.

Scientific publishing house Infinity, 2026.

Scientific Forum Proceedings combine materials of the conference – research papers and thesis reports of scientific workers. They examine technical, juridical and sociological aspects of research issues. Some articles deal with theoretical and methodological approaches and principles of research questions of personality professionalization. ...

Added: July 10, 2026

Этот смутный объект внимания: "реальные предметы" и гаптический опыт в рассказах В. Вулф

Shulyatieva D., Новое литературное обозрение 2026 № 199 С. 128–140

В статье рассмотрена гаптическая образность в поэтике В. Вулф на примере трех ее рассказов («Пятно на стене», «Женщина в зеркале», «Реальные предметы»), в центре которых оказываются предметы, устанавливающие обновленные отношения с героями. С опорой на теорию гаптической визуальности и на теорию вещи описаны трансформации, которые происходят с предметами, и переживание, которое открывается герою и нарратору при соприкосновении с ними, ...

Added: July 10, 2026

Two ga-morphemes in Rutul: Accidental similarity or a case of polygrammaticalization?

Maisak T., Word Structure 2026 Vol. 19 No. 2-3 P. 338–367

In a situation when two or more grammaticalization targets in one language are phonologically identical but functionally distinct, neither polygrammaticalization nor accidental syncretism can be ruled out, especially if we are dealing with a language without historical attestations. In the present paper, I present a detailed account of the coexistence of two homophonous grammatical markers ...

Added: July 9, 2026

Towards a typology of imperative interjections: ‘Take it!’ in the Caucasus

Maisak T., Transactions of the Philological Society 2026 Vol. 124 No. 2 P. 386–427

This paper presents a first typological study of a particular type of imperative interjections, namely interjections with the meaning ‘here, take it!’ used by a speaker when they ask the addressee to take something from the speaker's hands (often combined with a gesture of giving). The sample of languages is both geographically and genealogically restricted ...

Added: July 9, 2026

Light Verb Constructions from a Cross-Linguistic Perspective

Berlin, Boston: De Gruyter, 2025.

Light verb constructions are complex predicates consisting of a semantically reduced verb and an additional often phrasal element contributing the main predicational content. Although light verb constructions have been identified for various (genetically unrelated) languages, a comparative concept which allows identifying light verb constructions across languages is still missing. The present volume approaches this issue ...

Added: July 9, 2026

The Semiotic Intensity Approach: A Scoping Review of Amplification and Attenuation Mechanisms in Multimodal Media Discourse

Yin Z., Terra Linguistica 2026 Vol. 17 No. 2 P. 152–168

Abstract. In the context of global communication, the construction of national images in the media has evolved from passive reporting to active meaning modulation. Using China as a case study, this research introduces the Semiotic Intensity Approach (SIA) to quantify how news media integrate verbal, visual, and layout resources to either amplify or attenuate specific ...

Added: July 8, 2026

Комитет цензуры иностранной как институт культурного трансфера, или судьба итальянских книг и переводов с итальянского в цензурных документах 1830–1850-х годов

Bodrova A. S., Guskov S., Studi Slavistici 2026 Т. 23 № 1 С. 197–212

The article investigates foreign censorship as an institution of cultural transfer in the Russian Empire and its impact on the reception of Italian literature between the 1830s and 1850s. Drawing on archival materials, the authors demonstrate that censorship decisions were determined not only by the norms of the Censorship Statute (1828) but also by a ...

Added: July 5, 2026

Деепричастия в русском языке XVIIв.: переходный период в истории формирования их грамматического значения

Ermolova M., Russian Linguistics 2026 Т. 50 Статья 14

The article analyzes the functioning of gerunds in the Russian language of the 17th century. Basedon the analysis of contexts that are absent in modernRussian, itisconcludedthatinthe 17th century the gerund lost the absolute temporal meaning it once had, acquiring a relative meaning depending on the tense of the main predicate, while remaining, at the same ...

Added: July 4, 2026

Семантика необратимости в медиадискурсе ФРГ: эсхатологические коды и реакция аудитории в условиях кризиса

Moskvina Z. О., Вестник Российского университета дружбы народов. Серия: Литературоведение, журналистика 2026 Т. 31 № 2 С. 398–408

Abstract. This article explores the semantic and cognitive mechanisms governing the functioning of the lexeme “irreversibility” (Unumkehrbarkeit) within contemporary German media discourse covering the crisis in German-Russian relations. The study tests the hypothesis that the use of irreversibility semantics in the mass media serves as a rhetorical strategy intended to reinforce the perception of ongoing ...

Added: July 3, 2026

Men and women are from the same planet Gender similarities in perspective-taking abilities

Imbault C., Slioussar N., Ivanenko A. et al., The Mental Lexicon 2026 P. 1–23

The study examines emotional responses to words representing a wide range of psychological valence and focuses on gender-related differences. We aimed to find out whether men and women differ in their emotional responses, and whether they can take the perspective of another gender. We used the slider paradigm (Warriner et al., 2017): participants saw a humanoid ...

Added: July 2, 2026

Система синтаксических инвариантов текстовой деятельности: статистические дескрипторы, семантическая структура и диагностические профили

Kudriavtseva E., / РЦИС. Серия № 0148-756-286. 2026.

The content of the work is the system is a system for identifying four types of written speech structures. A set of 11 calculated parameters, statistical standards, and semantic characteristics allows for the identification of a text's structure as the result of a specific cognitive schema (scene, event, story, evaluation). The method has been verified ...

Added: June 2, 2026

Почему растущие доходы не делают людей счастливее: эмоциональное объяснение парадокса Истерлина (Why Growing Incomes Do Not Make People Happier: an Emotional Explanation of the Easterlin Paradox)

Vorchik A., / SSRN. Серия Social Science Research Network "Social Science Research Network". 2026.

This work is devoted to a theoretical explanation of the Easterlin paradox, according to which long-term economic growth does not make average level of people's happiness increasing. By happiness, we mean the intensity of emotions people experience while comparing their new income with its expected value, or the target income with its original value. In the first case, ...

Added: May 31, 2026

Школьный литературный канон эмиграции 1918–1939 гг.

Strizhkova D., / Институт русской литературы (Пушкинский Дом) РАН. Серия B001 "Репозиторий открытых данных по русской литературе и фольклору". 2026.

В базе данных представлена роспись русскоязычных литературных произведений и отрывков, напечатанных в учебниках по словесности, хрестоматиях, книгах для чтения, сборниках стихотворений и рассказов, выходивших во Франции, Германии, Латвии, Эстонии, Болгарии, Сербии в период первой волны русской эмиграции с 1918 по 1939 гг. Датасет представляет интерес для исследователей школьного литературного канона, эмиграции и детского чтения ...

Added: April 22, 2026

Современная российская мультипликация как инструмент воспитания традиционных духовно-нравственных ценностей

Жигунов А. Ю., / Basic Research Programme. Серия HUM "Humanities". 2026. № 1.

The article attempts to describe the features of the educational potential of Russian animation programmes in aspect of the representation of traditional spiritual and moral values. Based on media and semiotic analysis, the method of cultural and historical interpretation, animated Russian projects created from 2000 to the 2025, which were translated on television channels or streaming ...

Added: April 19, 2026