Автоматизация процесса адаптации текстов для электронных учебников. Проблемы и перспективы (на примере русского языка)

В. Г. Сибирцева; Н. В. Карпов

?

Автоматизация процесса адаптации текстов для электронных учебников. Проблемы и перспективы (на примере русского языка)

Nová rusistika/ Новая русистика. 2014. № 1. С. 19–35.

The paper is intended to describe the experience of using the authentic linguistic corpus materials within the project "Creating an electronic textbook of Russian as a foreign language". Special attention is paid to the fundamental principles of the new project – automatic adaptation of RNC’s linguistic material. Worked out by means of information technologies, the product is supposed to adapt the complexity of authentic texts in terms of their syntactic and morphologic structures and vocabulary. The stages indispensable to attain the objective are also explained in the article. The paper describes not only the algorithm for solving the tasks and the final result of the research, but also the difficulties, which the developers face, and their solutions.

Research target: Philology and Linguistics

Priority areas: humanitarian IT and mathematics

Language: Russian

Keywords: Национальный корпус русского языка русский язык как иностранный Russian as a foreign language Russian National Corpus Automatic adaptation of texts автоматическая адаптация текста морфология и синтаксис

Publication based on the results of:

Адаптация языкового материала НКРЯ для электронного учебника «Русский язык как иностранный» (2013)

No ‘iota’ type-shifter in Kazym Khanty

Tiutiunnikova V., Mikhailov Stiopa, Golosov F., Proceedings of Sinn und Bedeutung (Германия) 2025 No. 29 P. 1593–1608

In this paper, we present new challenging data from Kazym Khanty (a Uralic language spoken in Western Siberia, Russia): in this articleless language, bare singular and bare dual NPs in argument positions can receive indefinite readings on par with definite ones, contradicting the predictions of the classic neo-Carlsonian approach (Chierchia, 1998; Dayal, 2004). We argue ...

Added: January 30, 2026

Употребление порядковых числительных в разных семантических контекстах (на материале параллельных переводов Нового Завета)

Nasledskova P., Известия РАН. Серия литературы и языка 2025 Т. 84 № 6 С. 88–102

Работа посвящена сравнению употребления порядковых конструкций в разных семантических контекстах в пяти языках: русском, английском, испанском, индонезийском и рутульском. Сравнение проведено на материале параллельных переводов Нового завета. Из шести книг Нового Завета (канонические Евангелия, Деяния апостолов и Откровение Иоанна Богослова) были выбраны стихи, в которых хотя бы в одном из языков выборки употреблены порядковые числительные. ...

Added: January 29, 2026

Применение технологий ИИ в обучении студентов в рамках дисциплины «Академическое письмо на английском языке»

Gabrielova E., Магия ИННО 2025 Т. 7 № 1 С. 165–172

Artificial intelligence (AI) technologies are rapidly developing and are being widely applied in various fields, including education. The use of AI carries certain risks; however, one cannot completely reject it in student education. The article presents the experience of using AI in teaching English to 34 fourth-year students and 26 post-graduate students within the discipline ...

Added: January 29, 2026

Explorations in Applied Ethnolinguistics: Words, Cultures, and Global Perspectives

Palgrave Macmillan, 2025.

This volume contributes to the growing body of cutting-edge research into the Natural Semantic Metalanguage (NSM) approach in linguistics. It explores the broad range of possible applications enabled by the NSM approach, from linguistic studies of semantics and culture to cross-cultural studies, psychology and childhood education. The volume builds on previous studies, bringing a diversity ...

Added: January 28, 2026

Эпос о Гильгамеше. Перевод Николая Гумилева. Предисловие Е. Маркиной. Введение В. Шилейко.

Markina E., Манн, Иванов и Фербер, 2025.

Аннотация издателя: «Эпос о Гильгамеше» — древнейший памятник мировой литературы, дошедший до нас из глубин шумерской и аккадской цивилизаций. Поэма повествует о приключениях могущественного царя города Урука и его друга Энкиду. Это история о силе и дружбе, гордыне и смирении, страхе смерти и жажде бессмертия. Поэма издается в переводе поэта-акмеиста Николая Гумилева с пояснительной статьей ассириолога и современника поэта Владимира Шилейко, ...

Added: January 28, 2026

Semi-fake indexicals in Russian

Тискин Д. Б., Типология морфосинтаксических параметров 2025 Vol. 8 No. 1 P. 112–129

There are several rival theories of fake indexicals, i.e. bound indexicals (prominently pronouns) whose φ-features do not semantically contribute to focus alternatives (e.g. Only Mary did her homework, John didn’t do his). According to Minimal Pronoun theories (such as Kratzer’s or Wurmbrand’s), bound pronouns are Merged without φ-features and acquire them under binding via agreement-like ...

Added: January 26, 2026

Некоторые модификации к теории связанных употреблений индексальных выражений И. Басси

Тискин Д. Б., Типология морфосинтаксических параметров 2024 Т. 7 № 1 С. 107–123

Fake indexicals (FIs), or bound-variable uses of e.g. 1st - and 2 nd -person pronouns, have been analysed by Bassi (2021) as arising from a post-syntactic process of inspecting the features of the referent. This leads to a peculiar analysis of the syntax and semantics of relative clauses containing FIs. I argue for a more ...

Added: January 26, 2026

Искусство (не)простого юридического письма. Учебное пособие

Knutov A., Chaplinskiy A., Мищенко П. А. et al., М.: Проспект, 2026.

Учебное пособие содержит рекомендации к стилю юридического письма, следование которым поможет сделать его более понятным для читателей. Первая глава систематизирует накопившиеся знания об общих стилевых особенностях языка права и его месте в речевой системе русского языка. Последующие главы посвящены отдельным видам юридических документов: языку законов, языку процессуальных документов, языку договоров и языку юридических аналитических документов. ...

Added: January 26, 2026

Из переписки Е. А. Миллиор с Я. М. Боровским (1946–1960)

Ermakova L., Вестник Удмуртского университета. Серия История и филология 2025 Т. 35 № 6 С. 1403–1422

The article publishes and analyzes the correspondence between the historian of antiquity Elena A. Millior (1900–1978) and the classical philologist Yakov M. Borovsky (1896–1994), covering the years 1946–1960 and preserved in the archives of the Institute of Russian Literature (Pushkin House) of the Russian Academy of Sciences and the Bibliotheca Classica Petropolitana in St. Petersburg. ...

Added: January 26, 2026

Творчество Д.Н. Мамина-Сибиряка и современный мир

М., Екатеринбург: Кабинетный ученый, 2024.

В монографии рассматривается творчество классика уральской и общерусской литературы XIX в. Д. Н. Мамина-Сибиряка. Исследуются и описываются различные аспекты его художественного мира: аксиологическая и этическая проблематика, имеющие как универсальный, так и национальный характер, вопросы гео- и этнопоэтики, особенности нарративной организации текстов и художественного языка писателя, родословие Мамина и прикладные моменты его творчества, включая представление наследия писателя современной аудитории. Издание снабжено указателем произведений Мамина-Сибиряка. Книга предназначена для ...

Added: January 26, 2026

«Философия права» Гегеля и дело Коцебу: культурно-политический контекст

Lagutina I., Философические письма. Русско-европейский диалог 2025 Т. 8 № 4 С. 165–201

This article examines the assassination of the playwright August von Kotzebue by the theology student K. L. Sand as an event reflecting the ideological and philosophical tensions of early nineteenth-century Germany. It analyzes G. W. F. Hegel’s response to this historical episode in the context of his “Philosophy of Right”, which criticizes ethical and religious ...

Added: January 25, 2026

Китайский язык: второй иностранный язык: 5-й класс: учебник (8-е издание, стереотипное соответствует 6-му, переработанному)

Sizova A., Чэнь Ф., Чжу Ч. et al., М.: Просвещение, 2025.

Учебник «Китайский язык. Второй иностранный язык. 5 класс» серии «Время учить китайский!» создан совместно с издательством «People’s Education Press» (Китайская Народная Республика) и предназначен для обучающихся общеобразовательных организаций, начинающих изучать китайский язык в качестве второго иностранного языка с 5 класса. Настоящий учебник подготовлен в соответствии с требованиями ФГОС ООО, утверждённого Приказом Министерства просвещения РФ № ...

Added: January 23, 2026

A Note on Pliny the Elder (HN 9. 126)

Shumilin M., Classical Quarterly 2025 Vol. 75 No. 1 P. 516–519

The article argues for an emendation in Plin. HN 9.126. Modern editors are accustomed to print the text cum testa uiuas, adopting J. Hardouin’s conjecture for cum terra uitis, the reading transmitted in most manuscripts. Nevertheless, the overlooked manuscript reading contritis conchis allows us to deduce a palaeographically neater solution contritis if conchis is considered a gloss which entered the text. ...

Added: January 22, 2026

Olybrius or an Unknown Neronian Poet? The Date and Authorship of the Einsiedeln Eclogues Reconsidered

Shumilin M., Materiali e Discussioni per l'Analisi dei Testi Classici 2025 Vol. 95 P. 141–181

The paper examines the arguments of the proponents of a post-Neronian dating for the Einsiedeln Eclogues and in particular Justin Stover’s theory that these poems were composed by Anicius Hermogenianus Olybrius, the consul of AD 395. It is suggested that Stover’s evidence, just like the arguments adduced by other scholars who suggest that the Einsiedeln ...

Added: January 22, 2026

Reported speech constructions in Chuvash: A corpus- and elicitation-based study

Knyazev M., Voprosy Jazykoznanija 2026 No. 1 P. 74–104

The paper is a descriptive survey of reported speech constructions in standard and Maloe Ka-rachkino (Poshkart) Chuvash based on a typological questionnaire. On the basis of corpus data, it isshown that reported speech constructions vary depending on whether reported speech is introduced bya complementizer-like element in combination with an ordinary speech verb; directly by the ...

Added: January 22, 2026

Морфосинтаксический статус и семантика шугнанского показателя -ard: к развитию новых падежных маркеров в иранских языках

Падалка П. В., Ryzhova D., Чистякова Д. Г., Вопросы языкознания 2026 № 1 С. 40–58

The article is dedicated to the morphosyntactic properties and grammatical functions of the Shughni marker -ard. Shughni, like many other Iranian languages, has a reduced case system that seems to be gradually evolving and expanding. We demonstrate that the marker -ard is one of the main candidates for the status of a new case marker ...

Added: January 21, 2026

PINDAR PYTHIAN 2.13–20: WHAT DOES HIERON HAVE IN COMMON WITH CINYRAS?

Akhunova O., Classical Philology 2026 Vol. 121 No. 1 P. 84–93

In this article I attempt to answer the question: why and on what basis does Pindar compare Hieron with king Cinyras? Pindar marks three points of similarity: in the “portrait” of Cinyras these points are “favorite of Apollo” and “priest of Aphrodite”; the third point is indicated only in the “portrait” of Hieron – this is ...

Added: January 21, 2026

Constructing China's image in the British media during international crises: a case study of The Times newspaper since February 2022

Yin Z., Вестник Пермского университета. Серия: Политология 2025 Vol. 19 No. 2 P. 130–142

International society has noted a series of interlinked events that have implications for the construction of national representations within media discourse. This study probes how The Times represented China's image during global crises, with special focus on the period after Russia initiated its special military operation on February 24, 2022. Although this provides key context, ...

Added: January 20, 2026

Iterative Ricci-Foster Curvature Flow with GMM-Based Edge Pruning: A Novel Approach to Community Detection

Sorokin K., Beketov M., Онучин А. et al., / arxiv.org. Серия cs.SI "Social and Information Networks ". 2025.

Community detection in complex networks is a fundamental problem, open to new approaches in various scientific settings. We introduce a novel community detection method, based on Ricci flow on graphs. Our technique iteratively updates edge weights (their metric lengths) according to their (combinatorial) Foster version of Ricci curvature computed from effective resistance distance between the ...

Added: January 15, 2026

Implementing Transport Coding in OMNeT++ for Message Delay Reduction

Petrovanov I., Sergeev A., / Series Computer Science "arxiv.org". 2025. No. 2512.18332.

Transport coding reduces message delay in packet-switched networks by introducing controlled redundancy at the transport layer: original packets are encoded into coded packets, and the message is reconstructed after the first successful deliveries, effectively shifting latency from the maximum packet delay to the -th order statistic. We present a concise, reproducible discrete-event implementation of transport coding in OMNeT++, including ...

Added: December 24, 2025

Hessian-based lightweight neural network for brain vessel segmentation on a minimal training dataset

Меньшиков И. А., Бернадотт А. К., Elvimov N. S., / Series arXie "Statistical mechanics". 2025.

Accurate segmentation of blood vessels in brain magnetic resonance angiography (MRA) is essential for successful surgical procedures, such as aneurysm repair or bypass surgery. Currently, annotation is primarily performed through manual segmentation or classical methods, such as the Frangi filter, which often lack sufficient accuracy. Neural networks have emerged as powerful tools for medical image ...

Added: December 1, 2025

Determining the boundary of dynamical chaos in the generalized Chirikov map via machine learning

Чернышов Д. П., Satanin A., Shchur L., / Series arXiv "math". 2025.

We investigate the boundary separating regular and chaotic dynamics in the generalized Chirikov map, an extension of the standard map with phase-shifted secondary kicks. Lyapunov maps were computed across the parameter space (K,K(α, τ)) and used to train a convolutional neural network (ResNet18) for binary classification of dynamical regimes. The model reproduces the known critical ...

Added: November 21, 2025

Эффективный алгоритм торговли на фондовом рынке: ретроспективный анализ, основанный на данных по S&P-500.

Rubchinskiy A., Chubarova D., / Series WP7 "Математические методы анализа решений в экономике, бизнесе и политике". 2025. No. WP7/2025/01.

The article examines one of the most famous examples of socio-economic systems, characterized by significant uncertainty – the S&P-500 stock market, where shares of 500 largest US companies are traded. No assumptions are made about the probabilistic characteristics of the stock market. A flexible algorithm for daily trading has been developed, based on both known fixed data ...

Added: November 9, 2025

Diffusion on language model embeddings for protein sequence generation

Meshchaninov V., Strashnov, P., Shevtsov A. et al., / Cornell University. Серия CoRR, arXiv:2403.03726 "Computing Research Repository,". 2025.

Protein design requires a deep understanding of the inherent complexities of the protein universe. While many efforts lean towards conditional generation or focus on specific families of proteins, the foundational task of unconditional generation remains underexplored and undervalued. Here, we explore this pivotal domain, introducing DiMA, a model that leverages continuous diffusion on embeddings derived ...

Added: October 5, 2025