A Searching Tool for Russian Error-Annotated Learner English Corpus

A. Fenogenova; E. Kuzmenko

?

A Searching Tool for Russian Error-Annotated Learner English Corpus

NRU HSE , 2016.

Learner corpora constitute an effective resource for specialists in fields of second language acquisition, foreign language teaching and corpus linguistics. They tend to get significant scholarly help from statistical tools of various kinds. However, for valuable usage of a corpus it should provide convenient and powerful tools for searching and manipulating data. In this paper we focus on searching tools, presented in \textit{Russian Error-Annotated Learner English Corpus (REALEC)}, report our attempts to improve the format of the searching tools in our corpora. We also provide evidences that database search is much more efficient than common text search and demonstrate that search functionality in corpora is of great importance for research efficiency and extensive facilities.

Research target: Philology and Linguistics

Priority areas: humanitarian IT and mathematics

Language: English

Full text

Keywords: информационный поиск learner corpora английский язык как иностранный English as a second language учебные корпусы automatic search in corpora поиск в корпусах

Publication based on the results of:

Лексикологические исследования на базе учебного корпуса REALEC (Learner corpus REALEC: Lexicological observations) (2016)

Proceedings of the 9th Student Research Workshop associated with the International Conference Recent Advances in Natural Language Processing

Velichkov B., Nikolova-Koleva I., Slavcheva M., INCOMA Ltd, 2025.

The RANLP 2025 Student Research Workshop (RANLPStud’2025) is a special track of the established international conference Recent Advances in Natural Language Processing (RANLP’2025). The RANLPStud is being organised for the 9th time and this year is running in parallel with the other tracks of the main RANLP 2025 conference. The target of RANLPStud’25 is to be a ...

Added: May 12, 2026

«Плоский мир» Т. Пратчетта глазами русскоязычного фандома

Кульков А. Н., Tsvetkova M. V., Вестник Томского государственного университета. Филология 2026 № 100 С. 158–173

Впервые делается попытка рассмотреть особенности фанфикшн как акта продуктивной рецепции, возникшего на основе цикла романов Терри Пратчетта о Плоском мире в России. Проведенный анализ показывает, что прежде всего авторы фанфиков стремятся передать стилистику и комическое начало оригинального цикла Пратчетта, вне зависимости от жанра и формата создаваемых ими произведений. Фикрайтеры наиболее часто обращаются к таким форматам, ...

Added: May 10, 2026

Вселенная Достоевского

Pershkina A., М.: Альпина нон-фикшн, 2026.

Филолог Анастасия Першкина рассказывает о том, как писатель создавал свой мир, кем его населил, какие законы установил и почему этот мир так ярко действует на нас. Кроме того, вы узнаете, кто помогал Федору Михайловичу работать, как писатель связывал между собой произведения, что думали о его текстах современники и что же такое достоевщина. ...

Added: May 6, 2026

The hypothesis of dependence of the lexical nature of mixed languages on the patterns of their emergence

Gridneva E., Vestnik Tomskogo Gosudarstvennogo Universiteta, Filologiya 2026 No. 100 P. 38–52

This study investigates mixed languages, with a specific focus on their lexical characteristics. It proposes and substantiates the hypothesis that the degree of lexical mixing in such languages — reflected in the prevalence of doublets and the distribution of vocabulary between source languages — is linked to the specific pattern of their emergence, rather than ...

Added: May 6, 2026

Арест писателя Гюнтера Хофе на франкфуртской книжной ярмарке в 1963 г.: конкурирующие образы в медийном пространстве ГДР и ФРГ

Керимов Р. Э., Новое прошлое 2026 № 1 С. 148–162

The arrest of East German writer and publishing director Günter Hofé at the 1963 Frankfurt Book Fair became a unique episode of ideological confrontation between East and West Germany. Hofé is primarily known for his documentary-fiction trilogy about World War II, in which he actively participated as a Wehrmacht soldier. The analysis of the writer’s ...

Added: May 5, 2026

Семантический ореол сакрального в четырехстопном амфибрахии: механизмы культурной памяти в поэзии Ольги Седаковой

Максимов И. В., Новый филологический вестник 2025 Т. 73 № 2 С. 187–196

The majority of studies on the metrical aspects of Olga Sedakova’s poetry focus on the formal elements of versification, rarely exploring the substantive possibilities of the chosen metres. This paper fills this gap by analyzing the unified narrative of the four-foot amphibrach, tracing its development in Russian poetry from V.A. Zhukovsky to O.A. Sedakova. At ...

Added: May 5, 2026

Кубанская стела (Musée des Beaux Arts Grenoble, Collection égyptienne, inv. 1937, 1969, 3565)

Крол А. А., Кузнецов Д. А., Ladynin I. A., Восток. Афро-азиатские общества: история и современность 2026 Т. 1 С. 244–261

The publication presents a new translation and commentary of the Quban Stela of Ramesses II (Musée des beaux-arts Grenoble, Collection égyptienne, inv. 1937, 1969, 3565). This monument dates to the beginning of his reign (ca. 1287 BC); it was found near the ruins of the fortress of Baki, close to the Nubian village of Kuban. The composers of the ...

Added: May 5, 2026

Царь Рамсес и Бактрия. Об одном мотиве позднеегипетского историописания

Ladynin I. A., Вестник древней истории 2024 Т. 84 № 1 С. 5–26

The article analyses a set of Classical evidence reflecting the Egyptian conquest of Bactria or its attempt (Diod. I. 46–47; Tac. Ann. II. 60. 3; Strabo XVII. 1. 46), a statement of Manetho of Sebennytos on the vast conquests of king Sethos-Ramesses (I) (Manetho. Frg. 50 = Ios. C.Ap. I. 15. § 98–102), and the ...

Added: May 5, 2026

Цикл И. Бабеля «Великая Криница»: темпоральная структура в свете модерна.

Гендлина В. В., Новый филологический вестник 2025 № 1 С. 144–154

В статье анализируются две новеллы Исаака Бабеля начала 1930-х гг. о коллективизации -- «Гапа Гужва» и «Колывушка». Новеллы должны были стать частью цикла о коллективизации под общим названием «Великая Криница», однако замысел книги о преобразованиях в советской деревне оказался невоплощенным. В обеих новеллах Бабель показывает грандиозный проект модернизации колхозов как процесс, разрушающий существующий порядок и жизнь отдельно ...

Added: May 4, 2026

К вопросу о частеречной принадлежности и именовании нефинитных форм в лесном ненецком языке

Starchenko A., Kozlov A., Белов П. А., Известия РАН. Серия литературы и языка 2026 Т. 85 № 1 С. 77–97

The article examines the problem of part-speech classification and the terminological description of non-finite forms in Forest Nenets, drawing on new data from the Pur dialect. The study analyzes the system of Forest Nenets non-finite forms, which includes action nouns, participles, the gerund, the conditional form, and the supine. The analysis is carried out within ...

Added: May 4, 2026

РЕЧЕВЫЕ АКТЫ С ВЕЖЛИВЫМИ ДИМИНУТИВАМИ: ЖАНРОВЫЕ И ДИСКУРСИВНЫЕ ОСОБЕННОСТИ

Fufaeva I., Вестник Волгоградского государственного университета. Серия 2: Языкознание 2025 Т. 24 № 4 С. 78–90

The study delves into speech acts with diminutives used for politeness, focusing on their discursive and genre-related aspects. It draws on authorial recordings of colloquial speech, data from the National Corpus of the Russian Language, and recordings of urban speech from the 1970s and late twentieth century. The research highlights the potential usage of polite ...

Added: May 2, 2026

Искусственный интеллект как инструмент дифференциации при обучении иностранному языку

Bogolepova S., Smirnova A., Иностранные языки в школе 2026 № 4 С. 5–11

Differentiation in foreign language teaching is essential for accommodating individual trajectories of communicative competence development; however, its implementation is hindered by teachers' lack of time, resources, and training. Artificial intelligence (AI) helps overcome these barriers by enabling differentiation across content, process, and product. The article illustrates practical techniques supported by AI, including sample prompts and ...

Added: May 1, 2026

XI Международная конференция молодых исследователей "Текстология и историко-литературный процесс": сборник статей

М.: Издательские решения, 2025.

В настоящий сборник вошли работы участников XI Международной конференции «Текстология и историко-литературный процесс» на филологическом факультете МГУ имени М. В. Ломоносова. Статьи, представленные в книге, посвящены вопросам текстологии и истории литературы. ...

Added: April 30, 2026

«Подснежник. Журнал для детского и юношеского возрастов» (Санкт-Петербург, 1858 –1862). Роспись содержания

Фатеева М. С., Литературный факт 2022 Т. 26 № 4 С. 248–277

Работа представляет собой роспись содержания журнала для детского и юношеского чтения «Подснежник», выходившего в Санкт-Петербурге в 1858–1862 гг. под редакцией В.Н. Майкова. В издании журнала принимали участие многие хорошо известные литераторы середины XIX в. (И.А. Гончаров, Д.В. Григорович, А.Н. Майков и др.). Во вступительной статье кратко обрисована история издания «Подснежника», охарактеризованы появлявшиеся в нем материалы ...

Added: April 30, 2026

Ирония в пьесе Ватсараджи «Киратарджуния» (XII в.)

Минаева М. Д., Вестник Института востоковедения РАН 2025 № 6 С. 143–155

This article examines the rhetorical device of “irony” in the Sanskrit poetic tradition, using examples from the medieval playwright Vatsarāja’s Kirātārjunīya (“The Kirāta and Arjuna,” 12th century). This play belongs to the rare vyāyoga genre, which is characterized by the depiction of a great battle between two renowned heroes accompanied by a verbal duel filled with ...

Added: April 30, 2026

Natural hazard database from Internet publications: text mining with a large language model

Derkacheva A., Sakirkina M., Kraev G. et al., /. 2026.

Comprehensive data on natural hazards and their consequences are crucial for effective for risk assessment, adaptation planning, and emergency response. However, many countries face challenges with fragmented, inconsistent, and inaccessible data, particularly regarding local-scale events. To address this data gap in Russia, we developed an end-to-end processing pipeline that scrapes news from various online sources, ...

Added: April 28, 2026

Школьный литературный канон эмиграции 1918–1939 гг.

Strizhkova D., / Институт русской литературы (Пушкинский Дом) РАН. Серия B001 "Репозиторий открытых данных по русской литературе и фольклору". 2026.

В базе данных представлена роспись русскоязычных литературных произведений и отрывков, напечатанных в учебниках по словесности, хрестоматиях, книгах для чтения, сборниках стихотворений и рассказов, выходивших во Франции, Германии, Латвии, Эстонии, Болгарии, Сербии в период первой волны русской эмиграции с 1918 по 1939 гг. Датасет представляет интерес для исследователей школьного литературного канона, эмиграции и детского чтения ...

Added: April 22, 2026

Ising models on the hydrogen peroxide and other lattices

Qin X., Deng Y., Shchur L. et al., / Series arXiv "math". 2026. No. 2603.02962.

We perform a Monte Carlo analysis of the Ising model on many three-dimensional lattices. By means of finite-size scaling we obtain the critical points and determine the scaling dimensions. As expected, the critical exponents agree with the three-dimensional Ising universality class for all models. The irrelevant field, as revealed by the correction-to-scaling amplitudes, appears to ...

Added: April 20, 2026

Algorithmic overlaps as thermodynamic variables: from local to cluster Monte Carlo dynamics in critical phenomena

Pilé I., Deng Y., Shchur L., / Series arXiv "math". 2026. No. 2604.10254.

We investigate the spatial overlap of successive spin configurations in Markov chain Monte Carlo simulations using the local Metropolis algorithm and the Svendsen-Wang and Wolff cluster algorithms. We examine the dynamics of these algorithms for two models in different universality classes: the Ising model and the Potts model with three components. The overlap of two ...

Added: April 20, 2026

Современная российская мультипликация как инструмент воспитания традиционных духовно-нравственных ценностей

Жигунов А. Ю., / Basic Research Programme. Серия HUM "Humanities". 2026. № 1.

The article attempts to describe the features of the educational potential of Russian animation programmes in aspect of the representation of traditional spiritual and moral values. Based on media and semiotic analysis, the method of cultural and historical interpretation, animated Russian projects created from 2000 to the 2025, which were translated on television channels or streaming ...

Added: April 19, 2026

Using predefined vector systems to speed up neural network multimillion class classification

Gabdullin N., Androsov I., / Series Computer Science "arxiv.org". 2026.

Label prediction in neural networks (NNs) has O(n) complexity proportional to the number of classes. This holds true for classification using fully connected layers and cosine similarity with some set of class prototypes. In this paper we show that if NN latent space (LS) geometry is known and possesses specific properties, label prediction complexity can ...

Added: April 2, 2026

Iterative Ricci-Foster Curvature Flow with GMM-Based Edge Pruning: A Novel Approach to Community Detection

Sorokin K., Beketov M., Онучин А. et al., / arxiv.org. Серия cs.SI "Social and Information Networks ". 2025.

Community detection in complex networks is a fundamental problem, open to new approaches in various scientific settings. We introduce a novel community detection method, based on Ricci flow on graphs. Our technique iteratively updates edge weights (their metric lengths) according to their (combinatorial) Foster version of Ricci curvature computed from effective resistance distance between the ...

Added: January 15, 2026

Implementing Transport Coding in OMNeT++ for Message Delay Reduction

Petrovanov I., Sergeev A., / Series Computer Science "arxiv.org". 2025. No. 2512.18332.

Transport coding reduces message delay in packet-switched networks by introducing controlled redundancy at the transport layer: original packets are encoded into coded packets, and the message is reconstructed after the first successful deliveries, effectively shifting latency from the maximum packet delay to the -th order statistic. We present a concise, reproducible discrete-event implementation of transport coding in OMNeT++, including ...

Added: December 24, 2025

Hessian-based lightweight neural network for brain vessel segmentation on a minimal training dataset

Меньшиков И. А., Бернадотт А. К., Elvimov N. S., / Series arXie "Statistical mechanics". 2025.

Accurate segmentation of blood vessels in brain magnetic resonance angiography (MRA) is essential for successful surgical procedures, such as aneurysm repair or bypass surgery. Currently, annotation is primarily performed through manual segmentation or classical methods, such as the Frangi filter, which often lack sufficient accuracy. Neural networks have emerged as powerful tools for medical image ...

Added: December 1, 2025