Analysis of a Company Model in Conditions of Unstable Demand Using Reinforcement Learning Methods

Delev A.; Semakov S.

doi:10.1109/ICAIBD64986.2025.11081959

Publications

?

Analysis of a Company Model in Conditions of Unstable Demand Using Reinforcement Learning Methods

P. 318–322.

Delev A., Semakov S.

Profit is one of the most important economic indicators of a company’s performance, and for every company it is necessary to allocate resources in such a way as to obtain the maximum possible profit. The profit maximization problem is usually a dynamic optimization problem. This article discusses an approach to solving the production expansion problem using reinforcement learning (RL) methods. The task is to determine the company’s long-term strategy: how to use the company’s resources to expand production to maximize profits in the long term. The paper examines the possibility of applying reinforcement learning algorithms to similar problems and, to confirm this possibility, compares the results of the analytical solution of the problem using classical methods of optimal control theory with the results of RL models. Also in the work, both methods solve the problem of expanding production in conditions of unstable demand and show that RL methods converge to an analytical solution. The results of reinforcement learning algorithms and classical solution methods are compared based on the final profit that the company will receive in the long term over a 10-year horizon.

Language: English

DOI

Text on another site

Keywords: machine learning reinforcement learning optimal control

In book

2025 8th International Conference on Artificial Intelligence and Big Data (ICAIBD)

IEEE, 2025.

Разработка микросервиса ADP для идентификации источников выбросов на основе машинного обучения с подкреплением

Kychkin A., Chernitsin I., Прикладная информатика 2026 № 1(121) С. 40–58

The results of the development of a software microservice embedded in atmospheric air quality monitoring systems to support the identification of industrial pollution sources are presented. The emission and subsequent spread of harmful substances in the lower layers of the atmosphere is dynamic and characterized by high uncertainty due to the specific features of technological ...

Added: April 23, 2026

Machine Learning Approach to Anticancer Activity Prediction of Transition-Metal Complexes Based on a Large-Scale Experimental Database

Krasnov L., Malikov D., Kiseleva M. et al., Journal of Medicinal Chemistry 2026 Vol. 69 No. 8 P. 8838–8851

In this work, we developed a straightforward data-driven approach to predict the cytotoxicity of metal complexes based entirely on their (metal + ligands) composition. To this end, we have manually curated MetalCytoToxDB─a comprehensive experimental database comprising 26,500 IC50 values for 7050 metal complexes against 754 cell lines from 1921 articles. Based on these, machine learning ...

Added: April 23, 2026

Особые экономические зоны Российской Федерации: моделирование решений потенциальных резидентов и процесса их генерации

Plesovskikh A., Journal of Applied Economic Research 2023 Т. 22 № 2 С. 323–354

Modern studies widely discuss the role of special economic zones in stimulating the economic growth and development of Russia, generating the necessary investment flows and increasing the country's innovative potential by expanding production in high-tech sectors of the economy with high added value. The purpose of the study is to model the process of generating ...

Added: April 13, 2026

Replacing Criterion of Creativity with Criterion of Investment for Results Created by Artificial Intelligence

Pakshin P., Legal Issues in the Digital Age 2026 Vol. 7 No. 1 P. 32–48

Artificial intelligence plays a significant role in automation, minimizing human intervention in fields such as medicine, art, and law. Despite the historically close relationship between art and technology, generative AI has expanded the potential for creative activity. A significant catalyst for this process has been the proliferation of pre-trained AI systems, which have accelerated the ...

Added: March 31, 2026

A Tool for Mass Generation of Random Step Environment Models with User-Defined Landscape Features

Gabdrahmanov R., Tsoy T., Martinez-Garcia E. et al., , in: Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - (Volume 1) ICINCO 2024.: SciTePress, 2024. P. 511–518.

Computer simulations are growing in popularity in robotics research due to their near-zero cost of error and lower labor intensity. One of necessary components of a simulation, in addition to a robot model, is a model of a world in which the robot operates. While it is always possible to construct a world model manually, ...

Added: March 17, 2026

Real-Bogus Classification for ZTF Data Releases: Two Approaches

Semenikhin T., Kornilov M., Pruzhinskaya M. et al., , in: 26th International Conference, DAMDID/RCDL 2024, Nizhny Novgorod, Russia, October 23–25, 2024, Revised Selected Papers. Data Analytics and Management in Data Intensive Domains. (CCIS, volume 2641).: Springer, 2026. P. 211–219.

We considered two fundamentally different approaches to real-bogus classification within the Zwicky Transient Facility survey data. The first approach is based on neural networks that take sequences of object images as input. The second approach uses features extracted from light curves and classical machine learning methods. Several models for both approaches were tested. Quality metrics ...

Added: March 11, 2026

Кластеризация паттернов потребления электроэнергии умного дома на основе ансамблевых методов машинного обучения

Maltseva S. V., Бериков В. Б., Кладов Д. Е. et al., В кн.: Информатика и прикладная математика: Материалы X Международной научно-практической конференции (08.10 - 11.10.2025 г.)Т. 1: Сборник материалов часть 1.: Алматы: Институт информационных и вычислительных технологий КН МНВО РК, 2025. С. 227–232.

This paper examines the problem of clustering consumption patterns for a private household. An ensemble algorithm based on the Wasserstein metric was developed and applied to cluster daily load profiles. The proposed approach allows for identifying typical energy consumption scenarios and interpreting consumer behavior. Results from computational experiments using real data are presented. ...

Added: March 3, 2026

Method of Automated Dataset Collection for Microwave Filters Synthesis

Arinin O. V., Bakhmach D. M., Katsnelson A. et al., , in: 2025 Systems of Signals Generating and Processing in the Field of on Board Communications.: IEEE, 2025. P. 1–5.

This research discusses the method of dataset collection automatization for microwave filter synthesis by integrating machine learning techniques, thus reducing development time. Utilizing the 3D electromagnetic analysis software package, the study involves simulation and collecting geometric parameters and amplitude-frequency characteristics from three variants of passband highly selective microstrip tworesonator combined filters with stepped impedance resonators. ...

Added: December 6, 2025

Optimal Control for Stochastic Multi-agent Systems With the Use of Parallel Hybrid Genetic Algorithm

Akopov A. S., Beklaryan A., , in: Numerical Computations: Theory and Algorithms. 4th International Conference, NUMTA 2023, Pizzo Calabro, Italy, June 14–20, 2023, Revised Selected Papers, Part IVol. 14476.: Springer Publishing Company, 2025. P. 273–280.

In modern times, stochastic large-scale multi-agent systems (MAS) aimed at supporting socio-economic planning are being developed. There is a well known problem of a high computational complexity task of an optimal control for multiple agents’ behaviour in models of random interactions. In particular, agents (such as sellers and buyers) should make individualised decisions on establishing ...

Added: November 23, 2025

SPIRAL-LIKE SOLUTIONS IN OPTIMAL CONTROL PROBLEMS WITH CONTROL IN A DISK

Ronzhina M., Manita L., , in: Systems Analysis: Modeling and Control: Materials of the International Conference in memory of Academician A.V. Kryazhimskiy, Moscow, January 23–24, 2024. Abstracts.: -, 2024. P. 25–26.

For some class of small-dimensional optimal control problems we found a family of extremals in the form of logarithmic spirals. These extremals reach the singular surface in a finite time, while the control performs an infinite number of rotations around the circle. ...

Added: October 8, 2025

Artificial Neural Networks and Machine Learning. ICANN 2025 International Workshops and Special Sessions: 34th International Conference on Artificial Neural Networks, Kaunas, Lithuania, September 9–12, 2025, Proceedings, Part V

Cham: Springer, 2025.

This book constitutes the refereed proceedings of 34th International Workshops which were held in conjunction with the 34th International Conference on Artificial Neural Networks and Machine Learning, ICANN 2025, held in Kaunas, Lithuania, September 9–12, 2025. The 20 full papers and 8 abstracts included in this workshop volume were carefully reviewed and selected from 42 submissions. ...

Added: September 29, 2025

ОТСЛЕЖИВАНИЕ РАЗВИТИЯ РАЗРУШЕНИЯ С ПОМОЩЬЮ КЛАСТЕРИЗАЦИИ ИМПУЛЬСОВ ТЕРМИЧЕСКИ СТИМУЛИРОВАННОЙ АКУСТИЧЕСКОЙ ЭМИССИИ ПРИ ОТСУТСТВИИ ЛОКАЦИИ

Индаков Г. С., Казначеев П. А., Майбук З. Я. et al., Геофизические исследования 2025 Т. 26 № 2 С. 99–124

The paper studies the clusterability of acoustic emission pulses during high-temperature heating of sandstone sample preliminarily subjected to mechanical loading. Mechanical loading was applied in uniaxial mode up to load close to destructive with appearance of signs of large cracks on the surface. After that, samples were subjected to thermal treatment up to 650 °C ...

Added: September 19, 2025

Rewriting the Rules: LLMs Vs. Traditional ML in University Admissions

Chepikov I., Karpov I., , in: 26th International Conference, AIED 2025, Palermo, Italy, July 22–26, 2025, Proceedings, Part I. Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners, Doctoral Consortium, Blue Sky, and WideAIED.: Springer, 2025. P. 352 – 358.

Modern LLM models such as BERT, ChatGPT, DeepSeek have shown great potential in solving various tasks, including text classification, text generation, analysis and summary of documents. In this paper, we show that these models close to classical ML approaches based on decision trees not only in text processing, but also in processing classical tabular data ...

Added: September 4, 2025

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics

Wien: Association for Computational Linguistics, 2025.

Added: August 26, 2025

Pseudo-collusion in a centralized algorithmic financial market

Pastushkov A., Boulatov A., Finance Research Letters 2025 Vol. 83 Article 107671

Recent studies have increasingly explored whether reinforcement learning algorithms can give rise to cooperative behavior that results in non-competitive pricing across various market settings. In financial markets, Cartea et al. (2022) show that market makers using multi-armed bandit (MAB) algorithms generally converge to competitive pricing in quote-driven over-the-counter (OTC) markets, barring some unlikely exceptions where ...

Added: June 19, 2025

Экономические и социальные аспекты атомной энергетики в условиях развития технологий искусственного интеллекта

Podchufarov A., Galkina A. N., Ванина С. С. et al., Экономика и управление: проблемы, решения 2025 Т. 5 № 4 С. 61–74

Under modern conditions, the introduction of artificial intelligence technologies is becoming a significant factor in the development of high-tech industries. The article presents the results of a study of the prospects for the use of intelligent analytical systems in nuclear energy. The experience of foreign countries is analyzed and the features of successful projects using ...

Added: June 5, 2025

Periods of high uncertainty: How fertility intentions in Russia changed during 2022–2023

Vakulenko E., Gorskiy D., Kondrateva V. et al., Demographic Research 2025 Vol. 52 P. 939–970

BACKGROUND We study fertility intentions change in Russia, during the period of socio-economic shocks in 2022-2023, in response to the Russia-Ukraine armed conflict. OBJECTIVE Our objective is to identify factors that influence decision-making in a low fertility context during the crisis, including both objective characteristics and subjective assessment of the current situation. METHODS This paper is based on unique survey ...

Added: May 6, 2025

Prospects for Big Text Data Application in Technology Maturity Assessment (Publications Review)

Loginova I., Grozovskiy F., Aksenova A., Automatic Documentation and Mathematical Linguistics 2025 Vol. 59 No. 3 P. 145–153

The paper analyzes the limitations of conventional methods for assessing the maturity of technology, such as the S-curve, technology readiness level (TRL), Gartner’s hype cycle and their dependence on experts’ opinions. Current approaches to this task based on big text data analysis and machine learning algorithms are reviewed, and their advantages are demonstrated. As a ...

Added: April 28, 2025

Application of Physics-Informed Neural Networks for Solving the Inverse Advection-Diffusion Problem to Localize Pollution Sources

Derkach D., Efremenko D., Чупров И. А. et al., / Series Computer Science "arxiv.org". 2025. No. 2503.18849.

Added: March 25, 2025

The beer game bullwhip effect mitigation: a deep reinforcement learning approach

Rozhkov M., Alyamovskaya N., Zakhodiakin G., International Journal of Production Research 2025 Vol. 63 No. 18 P. 6630–6647

This article investigates the application of reinforcement learning (RL) methods to optimise a four-echelon linear supply chain model with stochastic demand. The proposed supply chain configuration is largely based on the production-distribution supply chain of the MIT Supply Chain Beer Game. We show that RL can significantly improve ordering efficiency and overall supply chain performance. ...

Added: March 24, 2025