?
Использование BERT для классификации коротких научных текстов на русском языке
С. 103–109.
This work is devoted to the study of approaches for training BERT-based classifiers of scientific articles to implement the application with the adoption of the best models for use in the infrastructure of the VINITI RAS. For this purpose, the BERT linguistic model was trained on a specialized corpus of scientific texts for subsequent use as an embedding part of the classifier. The results of experiments carried out to train models for classifying scientific articles according to the first and second levels of the Russian State Rubricator of Science and Technical Information (SRSTI) are provided.
Абрегова З. Х., Dvoynikova A., В кн.: Альманах научных работ молодых ученых Университета ИТМО.: Университет ИТМО, 2025. С. 487–493.
В статье рассматриваются различные методы аспектного анализа тональности текстовых данных, включая как традиционные методы, основанные на правилах, так и современные алгоритмы машинного и глубокого обучения. В работе представлен сравнительный анализ корпусов данных и методов аспектного анализа тональности текстов, который является важной задачей в области обработки естественного языка и анализа данных. ...
Added: April 25, 2026
Krasnov L., Malikov D., Kiseleva M. et al., Journal of Medicinal Chemistry 2026 Vol. 69 No. 8 P. 8838–8851
In this work, we developed a straightforward data-driven approach to predict the cytotoxicity of metal complexes based entirely on their (metal + ligands) composition. To this end, we have manually curated MetalCytoToxDB─a comprehensive experimental database comprising 26,500 IC50 values for 7050 metal complexes against 754 cell lines from 1921 articles. Based on these, machine learning ...
Added: April 23, 2026
Plesovskikh A., Journal of Applied Economic Research 2023 Т. 22 № 2 С. 323–354
Modern studies widely discuss the role of special economic zones in stimulating the economic growth and development of Russia, generating the necessary investment flows and increasing the country's innovative potential by expanding production in high-tech sectors of the economy with high added value. The purpose of the study is to model the process of generating ...
Added: April 13, 2026
Fedorov A., Вакку Г. В., Лебедева С. Э., Галактика медиа: журнал медиа исследований 2026 Т. 8 № 2 С. 163–182
With the increasing volume of data, university faculty may spend years processing and organizing information. Personalized assistance, content recommendations, data collection for literature reviews, and bibliographic citation formatting reinforce the role of artificial intelligence and neural network tools for scholarly communication. This paper discusses practical examples of using tools such as Elicit, SciSpace, Consensus, Undermind, ...
Added: April 7, 2026
Pakshin P., Legal Issues in the Digital Age 2026 Vol. 7 No. 1 P. 32–48
Artificial intelligence plays a significant role in automation, minimizing human intervention in fields such as medicine, art, and law. Despite the historically close relationship between art and technology, generative AI has expanded the potential for creative activity. A significant catalyst for this process has been the proliferation of pre-trained AI systems, which have accelerated the ...
Added: March 31, 2026
Gabdrahmanov R., Tsoy T., Martinez-Garcia E. et al., , in: Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - (Volume 1) ICINCO 2024.: SciTePress, 2024. P. 511–518.
Computer simulations are growing in popularity in robotics research due to their near-zero cost of error and lower labor intensity. One of necessary components of a simulation, in addition to a robot model, is a model of a world in which the robot operates. While it is always possible to construct a world model manually, ...
Added: March 17, 2026
Semenikhin T., Kornilov M., Pruzhinskaya M. et al., , in: 26th International Conference, DAMDID/RCDL 2024, Nizhny Novgorod, Russia, October 23–25, 2024, Revised Selected Papers. Data Analytics and Management in Data Intensive Domains. (CCIS, volume 2641).: Springer, 2026. P. 211–219.
We considered two fundamentally different approaches to real-bogus classification within the Zwicky Transient Facility survey data. The first approach is based on neural networks that take sequences of object images as input. The second approach uses features extracted from light curves and classical machine learning methods. Several models for both approaches were tested. Quality metrics ...
Added: March 11, 2026
Глазкова А. В., Смаль И. В., Lyashevskaya O. et al., Доклады Российской академии наук. Математика, информатика, процессы управления (ранее - Доклады Академии Наук. Математика) 2025 Т. 527 С. 146–155
This paper presents a study on the effectiveness of discriminative methods for abbreviation lemmatization in Russian texts. Unlike generative approaches, discriminative models select the optimal lemma from a fixed set of candidates, eliminating the risk of generating grammatically incorrect word forms. For the first time in Russian language processing, we conduct a comprehensive analysis of ...
Added: March 10, 2026
Glazkova A., Lyashevskaya O., Morozov D. et al., Journal of Mathematical Sciences 2025 Vol. 546 P. 32–47
This paper addresses the task of lemmatizing abbreviations in the Russian language. Abbreviation lemmatization is particularly challenging, as it involves not only transforming a word into its normal form but also correctly expanding the abbreviation. We explore two approaches to this task, both leveraging large pretrained language models. The first approach is generative, where the ...
Added: March 10, 2026
Maltseva S. V., Бериков В. Б., Кладов Д. Е. et al., В кн.: Информатика и прикладная математика: Материалы X Международной научно-практической конференции (08.10 - 11.10.2025 г.)Т. 1: Сборник материалов часть 1.: Алматы: Институт информационных и вычислительных технологий КН МНВО РК, 2025. С. 227–232.
This paper examines the problem of clustering consumption patterns for a private household. An ensemble algorithm based on the Wasserstein metric was developed and applied to cluster daily load profiles. The proposed approach allows for identifying typical energy consumption scenarios and interpreting consumer behavior. Results from computational experiments using real data are presented. ...
Added: March 3, 2026
V. V. Zunin, A. I. Afonin, V. I. Anoshin et al., Automatic Documentation and Mathematical Linguistics 2025 Vol. 59 No. 5 P. 287–293
The development of an artificial intelligence-based language model for classifying English-language scientific articles by SRSTI codes is described. This improves the processes of reviewing and indexing scientific publications. A pre-processed dataset of scientific articles was used for training and testing the models. An architecture for cascade classification was developed, and the performance of models with ...
Added: February 11, 2026
Arinin O. V., Bakhmach D. M., Katsnelson A. et al., , in: 2025 Systems of Signals Generating and Processing in the Field of on Board Communications.: IEEE, 2025. P. 1–5.
This research discusses the method of dataset collection automatization for microwave filter synthesis by integrating machine learning techniques, thus reducing development time. Utilizing the 3D electromagnetic analysis software package, the study involves simulation and collecting geometric parameters and amplitude-frequency characteristics from three variants of passband highly selective microstrip tworesonator combined filters with stepped impedance resonators. ...
Added: December 6, 2025
Психолингвистический анализ непрофессионального дискурса о коррупции в социальной сети в «ВКонтакте»
Кузнецова Ю. М., Maksimenko A., Станкевич М. А. et al., Мониторинг общественного мнения: Экономические и социальные перемены 2026 № 1 С. 231–250
The article is devoted to the analysis of the non-professional online discourse on corruption presented on the social network VKontakte. Unlike the institutional or expert approach, the focus on everyday, user-generated reflection allows you to capture the deep values, forms of expression and linguistic patterns of everyday morality, which is especially important in the context ...
Added: November 6, 2025
Kruchinskaia E., Вестник Российского университета дружбы народов. Серия: Политология 2025 Т. 27 № 3 С. 459–478
In the context of the current rise in global political tensions and widespread protests, the examination of affective polarization dynamics has become increasingly significant. This phenomenon, which previously dominated the American political scene and was studied through the lens of party affiliation, is now gaining attention in other countries as well. Affective polarization has been ...
Added: October 19, 2025
Индаков Г. С., Казначеев П. А., Майбук З. Я. et al., Геофизические исследования 2025 Т. 26 № 2 С. 99–124
The paper studies the clusterability of acoustic emission pulses during high-temperature heating of sandstone sample preliminarily subjected to mechanical loading. Mechanical loading was applied in uniaxial mode up to load close to destructive with appearance of signs of large cracks on the surface. After that, samples were subjected to thermal treatment up to 650 °C ...
Added: September 19, 2025
Chepikov I., Karpov I., , in: 26th International Conference, AIED 2025, Palermo, Italy, July 22–26, 2025, Proceedings, Part I. Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners, Doctoral Consortium, Blue Sky, and WideAIED.: Springer, 2025. P. 352 – 358.
Modern LLM models such as BERT, ChatGPT, DeepSeek have shown great potential in solving various tasks, including text classification, text generation, analysis and summary of documents. In this paper, we show that these models close to classical ML approaches based on decision trees not only in text processing, but also in processing classical tabular data ...
Added: September 4, 2025
Wien: Association for Computational Linguistics, 2025.
Added: August 26, 2025
Delev A., Semakov S., , in: 2025 8th International Conference on Artificial Intelligence and Big Data (ICAIBD).: IEEE, 2025. P. 318–322.
Profit is one of the most important economic indicators of a company’s performance, and for every company it is necessary to allocate resources in such a way as to obtain the maximum possible profit. The profit maximization problem is usually a dynamic optimization problem. This article discusses an approach to solving the production expansion problem ...
Added: August 25, 2025