Распределённые представления редких слов русского языка, учитывающие векторы однокоренных слов

?

Распределённые представления редких слов русского языка, учитывающие векторы однокоренных слов

Научно-техническая информация. Серия 2: Информационные процессы и системы. 2021. № 1.

Malafeev A., Мальтина Л. П.

In press

The paper proposes algorithms that perform automatic morphemic analysis of words and methods of distributed representations of words that indirectly use information about the morphemic composition through the averaging of vectors of same-root words. Morphemic analysis models for the Russian language are evaluated on samples of common and rare words. Several methods are proposed for obtaining distributed representations of rare words based on word2vec representations of same-root words. Our experiments have shown that on the problem of determining the semantic proximity of a pair of words, the proposed methods yield results that are comparable to the results of the fastText model or surpass them.

Research target: Computer Science

Priority areas: IT and mathematics

Language: Russian

Full text

Publication based on the results of:

Эффективные методы распознавания мультимедийных данных для задач анализа предпочтений пользователей мобильных устройств (2019)

Influence of the Normal Magnetic Component to Magnetotail Current Sheet Forma

Domrin V. I., Malova H. V., V. Yu. Popov et al., Cosmic Research 2026 Vol. 64 No. 2 P. 238–252

During magnetospheric perturbations a relatively thin current sheet with thickness about several proton gyroradii forms in the Earth’s magnetotail. In a framework of the kinetic model describing current sheet thinning in the magnetotail, the processes of its formation are investigated depending on the normal magnetic field magnitude which affects both the current sheet structure and particle dynamics within ...

Added: April 27, 2026

Asymmetric Equilibrium Structures of Superthin Current Sheets: The Asymmetry of Plasma Sources

Tsareva O. O., Malova H. V., V. Yu. Popov et al., Plasma Physics Reports 2026 Vol. 52 No. 2 P. 179–185

The influence of asymmetry of plasma sources on the structure and spatial localization of a superthin current sheet (STCS) supported by demagnetized electrons is studied using a self-consistent model. The simulation takes into account the presence of a single plasma source in the northern hemisphere, which makes the plasma flow asymmetric. It is demonstrated that the asymmetry of ...

Added: April 27, 2026

WWW '26: The ACM Web Conference 2026

Association for Computing Machinery (ACM), 2026.

It is our great pleasure to welcome you to the 35th edition of the Web Conference to be held on June 29 – July 3, 2026, in Dubai, United Arab Emirates. Following discussions with our partners and key stakeholders, we have taken the decision to postpone the ACM Web Conference 2026, initially planned for April 2026. ...

Added: April 23, 2026

Разработка микросервиса ADP для идентификации источников выбросов на основе машинного обучения с подкреплением

Kychkin A., Chernitsin I., Прикладная информатика 2026 Т. 21 № 1 С. 40–58

The results of the development of a software microservice embedded in atmospheric air quality monitoring systems to support the identification of industrial pollution sources are presented. The emission and subsequent spread of harmful substances in the lower layers of the atmosphere is dynamic and characterized by high uncertainty due to the specific features of technological ...

Added: April 23, 2026

2026 International Conference on Artificial Intelligence, Computer, Data Sciences and Applications (ACDSA)

IEEE, 2026.

Added: April 21, 2026

What Drives Multi-Chain Crypto Forecasting: Model Choice, Feature Selection, and Transferability

Wang M., Xiao Y., Braslavski P. et al., Mathematics 2026 Vol. 14 No. 8 Article 1286

Increasingly shaped by heterogeneous on-chain activity rather than a single shared market process, this study investigates 7-day-ahead forecasting using 147 market and on-chain indicators across eight major blockchain ecosystems from October 2023 to April 2025. We benchmark statistical, deep-learning, and foundation-model baselines under multiple feature-selection pipelines using both error metrics and Diebold–Mariano tests. TiRex achieves ...

Added: April 20, 2026

Cross-influence of two societies in deterministic evolutionary game

Shchur L., Antonov D., Burovski E., International Journal of Bifurcation and Chaos in Applied Sciences and Engineering 2026 P. 2650132-1–2650132-9

We present a simple model that simulates the possible influence of one society on another. Specifically, two societies evolve deterministically according to the well-known Nowak-May spatial game with the addition of mutual influence through connections that reflect the current states of the societies. This may be related to the influence of a global information resource ...

Added: April 20, 2026

Ising models on the hydrogen peroxide and other lattices

Qin X., Deng Y., Shchur L. et al., / Series arXiv "math". 2026. No. 2603.02962.

We perform a Monte Carlo analysis of the Ising model on many three-dimensional lattices. By means of finite-size scaling we obtain the critical points and determine the scaling dimensions. As expected, the critical exponents agree with the three-dimensional Ising universality class for all models. The irrelevant field, as revealed by the correction-to-scaling amplitudes, appears to ...

Added: April 20, 2026

Algorithmic overlaps as thermodynamic variables: from local to cluster Monte Carlo dynamics in critical phenomena

Pilé I., Deng Y., Shchur L., / Series arXiv "math". 2026. No. 2604.10254.

We investigate the spatial overlap of successive spin configurations in Markov chain Monte Carlo simulations using the local Metropolis algorithm and the Svendsen-Wang and Wolff cluster algorithms. We examine the dynamics of these algorithms for two models in different universality classes: the Ising model and the Potts model with three components. The overlap of two ...

Added: April 20, 2026

Проектирование сети Интернета вещей на основе многокритериальной оптимизации и информационного моделирования здания

Ebraheem A., Информационные процессы 2025 Т. 25 № 4 С. 787–798

The article proposes a method for planning the placement of access points and gateways inside buildings for constructing Internet of Things networks. The basis of the method is the use of information from a building information model, which makes it possible to easily take into account both the geometry and the physical and technical characteristics ...

Added: April 19, 2026

Modeling cosolvent effects on solubility in supercritical CO2 using data-driven approaches

Makarov D. M., Kalikin N., Gurikov P. et al., Journal of Supercritical Fluids 2026 Vol. 235 Article 106979

Supercritical CO2 (scCO2 ) is an environmentally friendly solvent, but its low polarity limits the solubility of polar compounds. Cosolvents are commonly used to enhance solvation capability, yet comprehensive datadriven studies are scarce. We compiled the largest dataset to date — 4401 experimental solubility records with 22 cosolvents for 93 nonionic solutes, plus 4855 records ...

Added: April 19, 2026

2026 28th International Conference on Digital Signal Processing and its Applications (DSPA)

IEEE, 2026.

A.S. Popov Russian Science and Technical Society with support from V. A. Trapeznikov Institute of Control Sciences, V.A. Kotelnikov Institute of Radio Engineering and Electronics, Autex Ltd. is leading the ХХVIII International Conference «Digital Signal Processing and its Applications — DSPA-2026» ...

Added: April 18, 2026

WWW '26: Proceedings of the ACM Web Conference 2026

NY: Association for Computing Machinery (ACM), 2026.

Added: April 17, 2026

Сопоставление номенклатур товаров ресторанов и поставщиков с помощью LLM — Case Study для ресторанного холдинга

Jin S., Panfilov P., Сулейкин А. С., Труды Института системного программирования РАН 2025 Т. 37 № 6 С. 163–176

In the modern restaurant business, accurate mapping of product nomenclatures between restaurants and suppliers is a critical task. Effective inventory management and procurement optimization directly impact business profitability. With the increase in suppliers and product variety, traditional mapping methods become less efficient. This study proposes using large language models (LLM) to automate and improve the ...

Added: April 17, 2026

Имитационное моделирование. Теория и практика (ИММОД 2025)

СПб.: АО "ЦТСС", 2025.

В научном издании представлены труды Двенадцатой всероссийской научно-практической конференции по имитационному моделированию и его применению в науке и промышленности «Имитационное моделирование. Теория и практика» (ИММОД-2025) по следующим направлениям: - теоретические основы и методология имитационного и комплексного моделирования; - методы исследования и оценки качества моделей, валидация и верификации моделей; - методы и системы распределенного моделирования; - ...

Added: April 17, 2026

Proceedings of the 2025 INTERNATIONAL CONFERENCE "QUALITY MANAGEMENT, DIGITAL SECURITY, INFORMATION TECHNOLOGIES" (2025 QM&DS&IT)

IEEE, 2025.

The themes for this year's conference were chosen as a means of bringing together academics and industrialists, engineering and management research, and providing a basis for discussion of issues arising across the engineering and business community in relation to Quality Management, Information Technologies, Digital Security aimed at developing engineers and managers for the future. The ...

Added: April 17, 2026

Proceedings of IEMTRONICS 2025 International IoT, Electronics and Mechatronics Conference, Volume 1

Singapore: Springer, 2026.

This book gathers selected research papers presented at IEMTRONICS 2025 (International IoT, Electronics and Mechatronics Conference), held during 3–5 April 2025 in London, United Kingdom, in hybrid mode. This book presents a collection of state-of-the-art research work involving cutting-edge technologies in the field of IoT, electronics mechatronics, and related areas. The work is presented in ...

Added: April 11, 2026

Restricted inverse optimal value problem on linear programming under weighted l1 norm

Jia J., Guan X., Pardalos P. M., Journal of Computational and Applied Mathematics 2026 Vol. 486 Article 117687

We study the restricted inverse optimal value problem on linear programming under weighted l1 norm (RIOVLP1). Given a linear programming problem (LP with a feasible solution x0 and a value K, we aim to adjust the cost vector c to such that x0 becomes an optimal solution of the problem (LP) whose objective value equals K. The objective function is to minimize the distance under weighted l1 norm. First, we reformulate ...

Added: April 10, 2026

Using predefined vector systems to speed up neural network multimillion class classification

Gabdullin N., Androsov I., / Series Computer Science "arxiv.org". 2026.

Label prediction in neural networks (NNs) has O(n) complexity proportional to the number of classes. This holds true for classification using fully connected layers and cosine similarity with some set of class prototypes. In this paper we show that if NN latent space (LS) geometry is known and possesses specific properties, label prediction complexity can ...

Added: April 2, 2026

Iterative Ricci-Foster Curvature Flow with GMM-Based Edge Pruning: A Novel Approach to Community Detection

Sorokin K., Beketov M., Онучин А. et al., / arxiv.org. Серия cs.SI "Social and Information Networks ". 2025.

Community detection in complex networks is a fundamental problem, open to new approaches in various scientific settings. We introduce a novel community detection method, based on Ricci flow on graphs. Our technique iteratively updates edge weights (their metric lengths) according to their (combinatorial) Foster version of Ricci curvature computed from effective resistance distance between the ...

Added: January 15, 2026

Implementing Transport Coding in OMNeT++ for Message Delay Reduction

Petrovanov I., Sergeev A., / Series Computer Science "arxiv.org". 2025. No. 2512.18332.

Transport coding reduces message delay in packet-switched networks by introducing controlled redundancy at the transport layer: original packets are encoded into coded packets, and the message is reconstructed after the first successful deliveries, effectively shifting latency from the maximum packet delay to the -th order statistic. We present a concise, reproducible discrete-event implementation of transport coding in OMNeT++, including ...

Added: December 24, 2025

Hessian-based lightweight neural network for brain vessel segmentation on a minimal training dataset

Меньшиков И. А., Бернадотт А. К., Elvimov N. S., / Series arXie "Statistical mechanics". 2025.

Accurate segmentation of blood vessels in brain magnetic resonance angiography (MRA) is essential for successful surgical procedures, such as aneurysm repair or bypass surgery. Currently, annotation is primarily performed through manual segmentation or classical methods, such as the Frangi filter, which often lack sufficient accuracy. Neural networks have emerged as powerful tools for medical image ...

Added: December 1, 2025

Determining the boundary of dynamical chaos in the generalized Chirikov map via machine learning

Чернышов Д. П., Satanin A., Shchur L., / Series arXiv "math". 2025.

We investigate the boundary separating regular and chaotic dynamics in the generalized Chirikov map, an extension of the standard map with phase-shifted secondary kicks. Lyapunov maps were computed across the parameter space (K,K(α, τ)) and used to train a convolutional neural network (ResNet18) for binary classification of dynamical regimes. The model reproduces the known critical ...

Added: November 21, 2025

Эффективный алгоритм торговли на фондовом рынке: ретроспективный анализ, основанный на данных по S&P-500.

Rubchinskiy A., Chubarova D., / Series WP7 "Математические методы анализа решений в экономике, бизнесе и политике". 2025. No. WP7/2025/01.

The article examines one of the most famous examples of socio-economic systems, characterized by significant uncertainty – the S&P-500 stock market, where shares of 500 largest US companies are traded. No assumptions are made about the probabilistic characteristics of the stock market. A flexible algorithm for daily trading has been developed, based on both known fixed data ...

Added: November 9, 2025