Применение методов машинного обучения для решения задачи автоматической рубрикации статей по УДК

А. Ю. Романов; К. Е. Ломотин; Е. С. Козлова

?

Применение методов машинного обучения для решения задачи автоматической рубрикации статей по УДК

Информационные технологии. 2017. Т. 23. № 6. С. 418–423.

Romanov A., Ломотин К. Е., Козлова Е. С.

The paper deals with the applicability of modern machine learning methods to the problem of automatic generation of UDC for scientific articles. As the classifiers, such models as artificial neural networks, logistic regression and boosting are considered. Graph algorithms and a prototype software module to generate UDC are designed.

Research target: Computer Science Philology and Linguistics

Priority areas: IT and mathematics engineering science

Language: Russian

Full text

Text on another site

Keywords: машинное обучение логистическая регрессия logistic regression machine learning artificial neural networks УДК классификация текстов text classification UDC boosting искусственная нейронная сеть бустинг

No ‘iota’ type-shifter in Kazym Khanty

Tiutiunnikova V., Mikhailov Stiopa, Golosov F., Proceedings of Sinn und Bedeutung (Германия) 2025 No. 29 P. 1593–1608

In this paper, we present new challenging data from Kazym Khanty (a Uralic language spoken in Western Siberia, Russia): in this articleless language, bare singular and bare dual NPs in argument positions can receive indefinite readings on par with definite ones, contradicting the predictions of the classic neo-Carlsonian approach (Chierchia, 1998; Dayal, 2004). We argue ...

Added: January 30, 2026

Употребление порядковых числительных в разных семантических контекстах (на материале параллельных переводов Нового Завета)

Nasledskova P., Известия РАН. Серия литературы и языка 2025 Т. 84 № 6 С. 88–102

Работа посвящена сравнению употребления порядковых конструкций в разных семантических контекстах в пяти языках: русском, английском, испанском, индонезийском и рутульском. Сравнение проведено на материале параллельных переводов Нового завета. Из шести книг Нового Завета (канонические Евангелия, Деяния апостолов и Откровение Иоанна Богослова) были выбраны стихи, в которых хотя бы в одном из языков выборки употреблены порядковые числительные. ...

Added: January 29, 2026

Метод преобразования речевого сигнала для улучшения разборчивости речи

Savchenko L., Савченко В. В., Радиотехника и электроника 2025 Т. 70 № 8 С. 753–760

The problem of improving speech intelligibility in voice communication systems is considered. The acute issue of speaker recognition when applying known methods for solving this problem is highlighted. To overcome the specified problem, a new method for transforming the speech signal based on an autoregressive model of the vocal tract and the principle of frequency-selective ...

Added: January 29, 2026

Specification Tests for Jump-Diffusion Models Based on the Characteristic Function

Belomestny D., Grobler G. L., Meintanis S. G. et al., International Statistical Review 2026 P. 1–31

Goodness-of-fit tests are suggested for several popular jump-diffusion processes. The suggested test statistics utilise the marginal characteristic function of the model and its L2-type discrepancy from an empirical counterpart. Model parameters are estimated either by minimising the aforementioned L2-type discrepancy or by maximum likelihood. A hybrid estimation method that uses moment estimation is also proposed ...

Added: January 29, 2026

Применение технологий ИИ в обучении студентов в рамках дисциплины «Академическое письмо на английском языке»

Gabrielova E., Магия ИННО 2025 Т. 7 № 1 С. 165–172

Artificial intelligence (AI) technologies are rapidly developing and are being widely applied in various fields, including education. The use of AI carries certain risks; however, one cannot completely reject it in student education. The article presents the experience of using AI in teaching English to 34 fourth-year students and 26 post-graduate students within the discipline ...

Added: January 29, 2026

Explorations in Applied Ethnolinguistics: Words, Cultures, and Global Perspectives

Palgrave Macmillan, 2025.

This volume contributes to the growing body of cutting-edge research into the Natural Semantic Metalanguage (NSM) approach in linguistics. It explores the broad range of possible applications enabled by the NSM approach, from linguistic studies of semantics and culture to cross-cultural studies, psychology and childhood education. The volume builds on previous studies, bringing a diversity ...

Added: January 28, 2026

Эпос о Гильгамеше. Перевод Николая Гумилева. Предисловие Е. Маркиной. Введение В. Шилейко.

Markina E., Манн, Иванов и Фербер, 2025.

Аннотация издателя: «Эпос о Гильгамеше» — древнейший памятник мировой литературы, дошедший до нас из глубин шумерской и аккадской цивилизаций. Поэма повествует о приключениях могущественного царя города Урука и его друга Энкиду. Это история о силе и дружбе, гордыне и смирении, страхе смерти и жажде бессмертия. Поэма издается в переводе поэта-акмеиста Николая Гумилева с пояснительной статьей ассириолога и современника поэта Владимира Шилейко, ...

Added: January 28, 2026

An Analysis of Sequential Patterns in Datasets for Evaluation of Sequential Recommendations

Klenitskiy A., Володкевич А. А., Pembek A. et al., ACM Transactions on Recommender Systems 2026

Sequential recommender systems are an important and in-demand area of research. These systems aim to use the order of interactions in a user’s history to predict future interactions. The premise is that the order of interactions and sequential patterns play an essential role. Therefore, it is crucial to use datasets that exhibit a sequential structure ...

Added: January 28, 2026

Semi-fake indexicals in Russian

Тискин Д. Б., Типология морфосинтаксических параметров 2025 Vol. 8 No. 1 P. 112–129

There are several rival theories of fake indexicals, i.e. bound indexicals (prominently pronouns) whose φ-features do not semantically contribute to focus alternatives (e.g. Only Mary did her homework, John didn’t do his). According to Minimal Pronoun theories (such as Kratzer’s or Wurmbrand’s), bound pronouns are Merged without φ-features and acquire them under binding via agreement-like ...

Added: January 26, 2026

Некоторые модификации к теории связанных употреблений индексальных выражений И. Басси

Тискин Д. Б., Типология морфосинтаксических параметров 2024 Т. 7 № 1 С. 107–123

Fake indexicals (FIs), or bound-variable uses of e.g. 1st - and 2 nd -person pronouns, have been analysed by Bassi (2021) as arising from a post-syntactic process of inspecting the features of the referent. This leads to a peculiar analysis of the syntax and semantics of relative clauses containing FIs. I argue for a more ...

Added: January 26, 2026

Autoregressive generation strategies for Top-K sequential recommendations

Klenitskiy A., Гусак Д. И., Володкевич А. А. et al., User Modelling and User-Adapted Interaction 2025 No. 35 Article 13

The goal of modern sequential recommender systems is often formulated in terms of next-item prediction. In this paper, we explore the applicability of transformer-based generative models for the Top-K sequential recommendation task, where the goal is to predict items that a user is likely to interact with in the “near future.” This goal aligns with ...

Added: January 26, 2026

Искусство (не)простого юридического письма. Учебное пособие

Knutov A., Chaplinskiy A., Мищенко П. А. et al., М.: Проспект, 2026.

Учебное пособие содержит рекомендации к стилю юридического письма, следование которым поможет сделать его более понятным для читателей. Первая глава систематизирует накопившиеся знания об общих стилевых особенностях языка права и его месте в речевой системе русского языка. Последующие главы посвящены отдельным видам юридических документов: языку законов, языку процессуальных документов, языку договоров и языку юридических аналитических документов. ...

Added: January 26, 2026

Marchenko–Pastur Law for Spectra of Random Weighted Bipartite Graphs

Nadutkina A., Tikhomirov A., Timushev D., Siberian Advances in Mathematics 2025 Vol. 34 P. 146–153

We study the spectra of random weighted bipartite graphs. We establish that, under specific assumptions on the edge probabilities, the symmetrized empirical spectral distribution function of the graph’s adjacency matrix converges to the symmetrized Marchenko-Pastur distribution function. ...

Added: January 26, 2026

Из переписки Е. А. Миллиор с Я. М. Боровским (1946–1960)

Ermakova L., Вестник Удмуртского университета. Серия История и филология 2025 Т. 35 № 6 С. 1403–1422

The article publishes and analyzes the correspondence between the historian of antiquity Elena A. Millior (1900–1978) and the classical philologist Yakov M. Borovsky (1896–1994), covering the years 1946–1960 and preserved in the archives of the Institute of Russian Literature (Pushkin House) of the Russian Academy of Sciences and the Bibliotheca Classica Petropolitana in St. Petersburg. ...

Added: January 26, 2026

Творчество Д.Н. Мамина-Сибиряка и современный мир

М., Екатеринбург: Кабинетный ученый, 2024.

В монографии рассматривается творчество классика уральской и общерусской литературы XIX в. Д. Н. Мамина-Сибиряка. Исследуются и описываются различные аспекты его художественного мира: аксиологическая и этическая проблематика, имеющие как универсальный, так и национальный характер, вопросы гео- и этнопоэтики, особенности нарративной организации текстов и художественного языка писателя, родословие Мамина и прикладные моменты его творчества, включая представление наследия писателя современной аудитории. Издание снабжено указателем произведений Мамина-Сибиряка. Книга предназначена для ...

Added: January 26, 2026

«Философия права» Гегеля и дело Коцебу: культурно-политический контекст

Lagutina I., Философические письма. Русско-европейский диалог 2025 Т. 8 № 4 С. 165–201

This article examines the assassination of the playwright August von Kotzebue by the theology student K. L. Sand as an event reflecting the ideological and philosophical tensions of early nineteenth-century Germany. It analyzes G. W. F. Hegel’s response to this historical episode in the context of his “Philosophy of Right”, which criticizes ethical and religious ...

Added: January 25, 2026

Тесты как инструменты оценивания в вузах: трудности и решения

Antipkina I., Иванущенко А. В., Калабина И. А. et al., Мир психологии. Научно-методический журнал 2025 № 4(123) С. 295–316

Low-quality test items pose significant risks of biased and inaccurate assessment in higher education. In this study, multi-disciplinary test banks were examined, first, using classical test theory and then using a Large Language Model (Grok). Our findings reveal a number of problems in university test items due to methodological shortcomings rather than content inaccuracies. Based ...

Added: January 22, 2026

Automatic detection of dyslexia based on eye movements during reading in Russian

Laurinavichyute A., Lopukhina A., Reich D., , in: Proceedings of the 63rd Annual Meeting of the Association for Computational LinguisticsVol. 2: Short papers.: Wien: Association for Computational Linguistics, 2025. P. 59–66.

Dyslexia, a common learning disability, requires an early diagnosis. However, current screening tests are very time- and resourceconsuming. We present an LSTM that aims to automatically classify dyslexia based on eye movements recorded during natural reading combined with basic demographic information and linguistic features. The proposed model reaches an AUC of 0.93 and outperforms the ...

Added: January 19, 2026

Применение машинного обучения для прогнозирования волатильности и улучшения торговых стратегий на российском фондовом рынке

Lysenok N., Фундаментальная и прикладная математика 2025 Т. 25 № 4 С. 90–107

The aim of the study is to assess to what extent modern machine learning methods can improve the accuracy of forecasting the volatility of Russian stocks and whether such improvements lead to real advantages when applied in investment strategies. The work combines a review of theoretical approaches to volatility analysis with empirical research based on ...

Added: January 16, 2026

Iterative Ricci-Foster Curvature Flow with GMM-Based Edge Pruning: A Novel Approach to Community Detection

Sorokin K., Beketov M., Онучин А. et al., / arxiv.org. Серия cs.SI "Social and Information Networks ". 2025.

Community detection in complex networks is a fundamental problem, open to new approaches in various scientific settings. We introduce a novel community detection method, based on Ricci flow on graphs. Our technique iteratively updates edge weights (their metric lengths) according to their (combinatorial) Foster version of Ricci curvature computed from effective resistance distance between the ...

Added: January 15, 2026

Aggression in Digital Interactions: The Effect of Toxicity in Online Gaming Communication

Iuliia Naidenova, Parshakov P., Matkin N., Journal of Content, Community and Communication 2025 Vol. 23 Article 5

This study analyzes the computer mediated text communication of non-professional video game players. The purpose of this study is to identify the impact of player communication toxicity on team performance. The dataset comprises 42,720 matches played between November 5 and November 18, 2015, including game statistics and chat messages. We use a BERT-model to classify ...

Added: December 29, 2025

Implementing Transport Coding in OMNeT++ for Message Delay Reduction

Petrovanov I., Sergeev A., / Series Computer Science "arxiv.org". 2025. No. 2512.18332.

Transport coding reduces message delay in packet-switched networks by introducing controlled redundancy at the transport layer: original packets are encoded into coded packets, and the message is reconstructed after the first successful deliveries, effectively shifting latency from the maximum packet delay to the -th order statistic. We present a concise, reproducible discrete-event implementation of transport coding in OMNeT++, including ...

Added: December 24, 2025

High-accuracy eosinophil detection in eosinophilic esophagitis histological images using machine learning model YOLO11

Astaf’ev A. V., Maslenkina K. S., Mikhaleva L. M. et al., Доказательная гастроэнтерология 2025

OBJECTIVE To evaluate the effectiveness of the YOLO11 machine learning model for automated segmentation and detection of eosinophils in histological images in order to improve the diagnostic accuracy of eosinophilic esophagitis (EoE). MATERIALS AND METHODS A multicenter retrospective analysis was conducted using histological images obtained through whole slide imaging (WSI) from 60 patients diagnosed with EoE. Out of 653 tissue section images, 54 were manually annotated and reviewed. The annotated dataset was then used to train the YOLO11 model. RESULTS By the 150th training epoch, the model demonstrated consistent improvement in precision ...

Added: December 23, 2025

Compression-Induced Lattice Tilting Quenches Ion Migration at Metal Halide Perovskite Grain Boundaries: A Machine Learning Molecular Dynamics Study

Mikhail R. Samatov, Liu D., Emir S. Amirov et al., The Journal of Physical Chemistry Letters 2025 Vol. 16 No. 51 P. 13068–13074

Ion migration at grain boundaries (GBs) is a key issue leading to the performance degradation of metal halide perovskites (MHPs). Given the weak lattice interactions, the properties of MHPs are highly sensitive to external strain, which is inevitable in practical applications. Nevertheless, a fundamental understanding of the GB behavior under strain is still lacking. Using ...

Added: December 20, 2025