Параллельные белорусско-русский и русско-белорусский корпусы: совместный проект Национального корпуса русского языка

Д. В. Сичинава; Т. А. Архангельский

Publications

?

Параллельные белорусско-русский и русско-белорусский корпусы: совместный проект Национального корпуса русского языка

С. 54–60.

Sichinava D., Arkhangelskiy T.

Language: Russian

Keywords: корпусная лингвистика corpus linguistics parallel corpora параллельные корпуса

In book

Корпусы национальных языков: модели и технологии. Труды Казанской школы по компьютерной и когнитивной лингвитике TEL-2012

Каз.: Издательство «Фэн» Академии наук Республики Татарстан, 2012.

Зачем нужен поэтический корпус и как его использовать

Korchagin K., Русская речь 2019 Т. 6 С. 113–127

Поэтический корпус в составе Национального корпуса русского языка — инструмент для исследователей русской поэзии и поэтическо го языка. Корпус содержит обширную коллекцию русской поэзии XVIII ХХ веков, отражает все заметные поэтические направления и продол жает пополняться. В нем присутствуют два типа разметки — граммати ческая и стиховедческая. Если первая совпадает с разметкой в основ ном ...

Added: June 19, 2026

Syntactic functions of non-manuals in Russian Sign Language

Burkova S., Khristoforova E., Kimmelman V., , in: Advances in Sign Language Corpus Linguistics.: John Benjamins Publishing Company, 2023. P. 90–129.

This chapter presents the Russian Sign Language (RSL) Corpus and demonstrates its capabilities as a research tool by summarizing three corpus-based studies primarily focused on syntactic functions of nonmanual markers. The first study considers question marking in regular wh-questions and in question-answer pairs. It shows that the two constructions have very different nonmanual markers. The second study analyzes marking of ...

Added: June 3, 2026

Focus on vocabulary. Экономика материальных и нематериальных активов: корпусный словарь и ИИ-упражнения по английскому языку

Gorina O. G., Kucherenko S., Larisa K. et al., СПб.: Астерион, 2026.

This textbook is an integrated teaching and learning resource for English for Specific Purposes (ESP) in the field of economics of tangible and intangible assets. Its design employs (i) modern corpus linguistics methods, including frequency analysis and keyword extraction based on authentic texts reflecting current trends in professional discourse, and (ii) artificial intelligence technologies for ...

Added: May 16, 2026

Российская социология в условиях цифровизации общества: результаты анализа корпуса научных текстов

Smirnov A., Социологические исследования 2023 № 4 С. 39–50

Using the analysis of a corpus of texts from eight leading Russian sociological journals, the article examines the impact of the digitalization of society on sociology in 2000–2021. Frequency analysis of 13.8 thousand scientific texts tracked the introduction of concepts related to digitalization into academic circulation. The article reveals the differences between the journals, due ...

Added: March 18, 2026

Promotional adjectives in grant proposal abstracts: a corpus study

Dmitriy S. Tulyakov, Tatiana M. Permyakova, Ekaterina A. Balezina, Вестник Волгоградского государственного университета. Серия 2: Языкознание 2025 Vol. 24 No. 6 P. 58–67

By effectively integrating promotional discourse into grant proposal abstracts, researchers can more compellingly present their ideas and increase their chances of securing funding. Implications of promotional adjectives in grant writing might differ across various research fields. This study aims to explore the use of promotional adjectives in abstracts of research grant proposals in six research ...

Added: March 2, 2026

Kirina M., Лукьянчикова А. С., В кн.: Язык в эпоху цифровых трансформаций и развития искусственного интеллекта : Сборник научных статей по итогам II Международной научной конференции Минск, 23–24 октября 2025 г.: Мн.: БГУИЯ, 2025. С. 74–85.

В статье рассматриваются характерные особенности гороскопических текстов как части астрологического дискурса. Материалом исследования выступает представительная выборка ежедневных предсказаний на русском языке, опубликованных в открытых группах социальной сети «ВКонтакте», суммарным объемом 1185425 словоупотреблений. С использованием методов корпусной и компьютерной лингвистики анализируются содержательные лексические единицы – как общие, так и отличительные для каждого знака зодиака (в сопоставлении ...

Added: February 28, 2026

Динамика восприятия площадей в пространстве города носителями русского языка (сравнительный анализ по данным НКРЯ)

Belova P., В кн.: Актуальные вопросы лингвистики и литературоведения: сборник научных статей по материалам международной научной конференции памяти доктора филологических наук, профессора Л.А. Араевой (6–8 февраля 2025).: Кемеровский государственный университет, 2025. С. 155–160.

This article contains research results on the dynamics of squares’ perception in the city space in the Russian language picture of the world over time, starting from the second half of the XXth century to the present. Turning to the subcorpus of literary texts of the second half of the XXth century and the XXIst ...

Added: February 4, 2026

Языковая концептуализация пространства в художественном тексте (по данным НКРЯ)

Belova P., В кн.: Когнитивные исследования языка. Вып. №1 (62): материалы Международной научной конференции по когнитивной лингвистике. 5-7 июня 2025. Ч. 2Ч. 2. Кн. 62. Вып. 1.: ТюмГУ-Press, 2025. С. 56–60.

Данная статья представляет результаты изучения содержания концепта ПРОСТРАНСТВО в русском языковом сознании на материале художественных прозаических текстов разных жанров, созданных во второй половине XX века и в XXI веке и представленных в НКРЯ. Анализ проведен с учетом таких культурно-языковых фильтров, как пропозициональные установки, предметно-понятийные корреляции и метафорические преобразования. ...

Added: February 4, 2026

Два подхода к дифференциации терминов миграционных исследований (по данным корпусного анализа)

Permyakova T. M., Smirnova E. A., Новые исследования Тувы 2025 № 4 С. 122–136

The article presents a quantitative and qualitative analysis of English-language terms related to the study of migration.The sources used were research articles in the social sciences published between 2018 and 2020 in international first-quartile journals indexed in the Scopus database. The corpus-linguistic study addresses two objectives: to identify functioning systems of terms in scientific articles ...

Added: December 1, 2025

Preposition drop in Russian spoken by Mari and Beserman bilinguals

Yakovleva A., Kosheliuk N., Moroz G., International Journal of Bilingualism 2025 P. 1–19

Aims and Research Questions: In this paper, we present a corpus-based study of preposition drop (p-drop) in the speech of Mari-Russian and Beserman-Russian bilinguals compared to the speech of Russian monolinguals. Based on data from spoken corpora, we demonstrate that the prepositions v ‘in’, k ‘to’, s ‘with’ are omitted in the speech of bilinguals ...

Added: November 26, 2025

Вариативность годов vs. лет в русских говорах: корпусное исследование

Zemicheva S., Moroz G., Naccarato C., Вопросы языкознания 2025 № 6 С. 7–34

Наличие супплетивной формы лет в парадигме существительного год отличает русский язык от других восточнославянских. При этом в русских говорах вместо лет может использоваться вариант годов. Данные панхронического подкорпуса НКРЯ показывают, что форма годов, зафиксированная впервые в XV в., на всем протяжении истории русского языка была периферийной, в XVII–XVIII вв. использовалась преимущественно в нехудожественных текстах, а в ...

Added: November 12, 2025

Automatic Annotation of Discourse and Speech Formulas in Internet Communication: A Telegram Comment Corpus

Maslenikova A., Tatiana I. Popova, , in: 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part I. Speech and Computer. Lecture Notes in Artificial Intelligence 16187Vol. 16187: Lecture Notes in Artificial Intelligence.: Springer, 2025. P. 278–292.

This article presents a system for the automatic processing of user comments aimed at annotating speech and discourse formulas that actively function in everyday interaction, including digital communication. A Python-based program using the Telegram API was developed to automate the collection, filtering, and annotation of empirical data. In addition to building a user corpus, the ...

Added: October 19, 2025

27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part II. Speech and Computer. Lecture Notes in Artificial Intelligence 16188

Springer, 2025.

This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or ...

Added: October 19, 2025

Variation in a Narrative Corpus of Mano and Kpelle: Contact-Induced or Not?.

Khachaturyan M., Konoshenko M., Moroz G. et al., , in: N’yng-dyuumgu, n’yng-ngafq: Festschrift for Ekaterina GruzdevaVol. 126.: Helsinki: Studia Orientalia, 2025. P. 35–59.

This paper explores a corpus of spontaneous narratives and narrative retellings told by children and adults in Mano and Kpelle, two contacting Mande languages. It focuses on quotative constructions as a key point of grammatical dissimilarity between Mano and Kpelle. In the Mano speech of some bilingual children, however, these constructions are found to manifest ...

Added: September 5, 2025

Анализ тематики повседневных разговоров: экспертный подход и автоматические методы

Sherstinova T., Вепринцева Д. А., Человек: образ и сущность. Гуманитарные аспекты 2025 № 2(62) С. 89–108

В статье рассматриваются три разных подхода к изучению тематики повседневных разговоров: экспертная тематическая разметка и два автоматических метода (тематическое моделирование и кластеризация). Материалом для исследования послужили расшифровки русской устной повседневной речи из корпуса ОРД, подготовленные на основе звукозаписей спонтанных разговоров, выполненных в естественных коммуникативных ситуациях (дома, на работе, в учебном заведении, в магазине, в поликлинике ...

Added: September 3, 2025