Scale-free memory model for multiagent reinforcement learning. Mean field approximation and rock-paper-scissors dynamics

?

Scale-free memory model for multiagent reinforcement learning. Mean field approximation and rock-paper-scissors dynamics

The European Physical Journal B. 2010. Vol. 76. No. 1. P. 69–85.

Lubashevsky I., Kanemoto S.

A continuous time model for multiagent systems governed by reinforcement learning with scale-free memory is developed. The agents are assumed to act independently of one another in optimizing their choice of possible actions via trial-and-error search. To gain awareness about the action value the agents accumulate in their memory the rewards obtained from taking a specific action at each moment of time. The contribution of the rewards in the past to the agent current perception of action value is described by an integral operator with a power-law kernel. Finally a fractional differential equation governing the system dynamics is obtained. The agents are considered to interact with one another implicitly via the reward of one agent depending on the choice of the other agents. The pairwise interaction model is adopted to describe this effect. As a specific example of systems with non-transitive interactions, a two agent and three agent systems of the rock-paper-scissors type are analyzed in detail, including the stability analysis and numerical simulation. Scale-free memory is demonstrated to cause complex dynamics of the systems at hand. In particular, it is shown that there can be simultaneously two modes of the system instability undergoing subcritical and supercritical bifurcation, with the latter one exhibiting anomalous oscillations with the amplitude and period growing with time. Besides, the instability onset via this supercritical mode may be regarded as “altruism self-organization”. For the three agent system the instability dynamics is found to be rather irregular and can be composed of alternate fragments of oscillations different in their properties.

Research target: Physics Psychology Mathematics

Численное моделирование полевой эмиссии из полупроводникового катода в вакуум

Borisov V., Danilov V., Электросвязь 2025 № 12 С. 73–84

Представлены результаты математического моделирования процесса полевой эмиссии из катода малых размеров – одного из основных физических процессов, обеспечивающих работы многих электронных устройств, в частности FED-дисплеев (устройства, работающие на принципе полевой эмиссии), кантилеверов и т.д. Дается краткий обзор текущих результатов в области исследования, обосновывается актуальность задачи, приводятся примеры наиболее вероятного использования результатов решения задачи. Обсуждаются физические ...

Added: May 29, 2026

Методическая концепция развития навыков саморегуляции у студентов-музыкантов в классе вокала

Ван Г., Торопова А. В., Музыкальное искусство и образование 2025 Т. 13 № 4 С. 73–91

Contemporary vocal pedagogy faces a complex set of methodological challenges that encompass not only the search for ways to improve technical vocal skills for the realization of artistic vision, but also the development of self-control and self-regulation in future singers, as well as the entire creative process. A successful artist cannot function without the ability ...

Added: May 29, 2026

ИССЛЕДОВАНИЕ АССОЦИАЦИИ ГЕНЕТИЧЕСКИХ ВАРИАНТОВ С РАЗВИТИЕМ МУЗЫКАЛЬНЫХ СПОСОБНОСТЕЙ ЧЕЛОВЕКА

Kazantseva A. V., A.V. Toropova, Khusnutdinova E. K. et al., ВАВИЛОВСКИЙ ЖУРНАЛ ГЕНЕТИКИ И СЕЛЕКЦИИ, Федеральный исследовательский центр Институт цитологии и генетики Сибирского отделения Российской академии наук» (ИЦиГ СО РАН) (Новосибирск) 2025 Vol. 30 No. 3 P. 470–481

The development of musical abilities, including absolute pitch, musical memory, rhythm sense, and musicality, at a high degree is determined by a hereditary component (up to 68 %). The studies implementing a genome-wide linkage and association approach to musical aptitude have revealed more than 100 genetic loci. This spectrum is comprised of the genes encoding ...

Added: May 29, 2026

Electrical networks and data analysis in phylogenetics

Gorbounov Vassily, Kazakov A., Data Analytics and Topology 2025 Vol. 1 No. 1 P. 33–45

A classic problem in data analysis is studying the systems of subsets defined by either a similarity or a dissimilarity function on X which is either observed directly or derived from a data set. For an electrical network there are two functions on the set of the nodes defined by the resistance matrix and the response ...

Added: May 28, 2026

Почему растущие доходы не делают людей счастливее: эмоциональное объяснение парадокса Истерлина

Vorchik A., Вопросы экономики 2026

This work is devoted to a theoretical explanation of the Easterlin paradox, according to which long-term economic growth does not make average level of people's happiness increasing. By happiness, we mean the intensity of emotions people experience while comparing their new income with its expected value, or the target income with its original value. In the first case, ...

Added: May 28, 2026

Электростатически управляемый контроль диссипации в двумерном наноэлектромеханическом резонаторе через напряжение-амплитудный антагонизм с рекордной 928% подстройкой добротности.

Arutyunov K., JIANG1 Q., FANG J. et al., Science China Information Sciences 2026 Vol. 69 No. 6 P. 1–11

Resonators based on nanoelectromechanical systems (NEMS) using two-dimensional (2D) materials with high-quality factors and excellent electrical control are critical for tunable coherent phonon dynamics, resonant sensors and wireless communications. However, their performance is fundamentally limited by the lack of a unified framework governing energy dissipation mechanisms and their electrical tunability. Here, we synergistically modulate both ...

Added: May 28, 2026

Enhanced Terahertz Thermoelectricity Via Engineered Van Hove Singularities and Nernst Effect in Moiré Superlattices

Elesin L., Shilov A., Jana S. et al., Advanced Functional Materials 2026 P. 1–10

Thermoelectric materials, long explored for energy harvesting and thermal sensing, convert heat directly into electrical signals. Extending their application to the terahertz (THz) frequency range opens opportunities for low-noise, bias-free THz detection, yet conventional thermoelectrics lack the sensitivity required for practical devices. Thermoelectric coefficients can be strongly enhanced near van Hove singularities (VHS), though these ...

Added: May 28, 2026

Sweet-taste liking is associated with preference for less risky and immediate rewards in economic decision-making

Давидович А. С., Shestakova A., Arzumanyan N. et al., Frontiers in Psychology 2026 Vol. 17 - 2026 P. 1–18

Background: Delay discounting refers to the tendency to choose sooner, smaller rewards over larger, later rewards. Many previous studies link this tendency positively to reward sensitivity, yet the specific mechanisms behind this association remain poorly understood. Reward sensitivity may relate to delay discounting through at least three possible pathways: increased sensitivity to reward size, increased sensitivity ...

Added: May 27, 2026

Non-linear in-band interference cancellation on base of conjugate gradients method

Degtyarev A., Bakhurin S., Yudin N., DSPA 2026 P. 1–6

This paper investigates one possible solution to the problem of self-interference cancellation (SIC) arising in the design of in-band full-duplex (IBFD) communication systems. Self-interference cancellation is performed in the digital domain using multilayer nonlinear models adapted via gradient-based optimization. The presence of local minima and saddle points during the adaptation of multilayer models limits the ...

Added: May 26, 2026

New Numerical Invariants of an Unfolding of a Polycycle “Tears of the Heart”

Ilyashenko Y., Shilin I., Stanislav Minkov, Russian Journal of Mathematical Physics 2026 Vol. 33 No. 1 P. 89–106

In this paper, new numerical invariants of structurally unstable vector fields in the plane are found. One of the main tools is an improved asymptotics of sparkling saddle connections that occur when a separatrix loop of a hyperbolic saddle breaks. Another main tool is a new topological invariant of two arithmetic progressions, both perturbed and unperturbed, on the ...

Added: May 26, 2026

Heritability of Functional Literacy: Evidence from a Classical Twin Design

Kolachev N., Kovaleva G., Behavior Genetics 2026

Functional literacy—the ability to apply reading, mathematical, and scientific knowledge in authentic contexts as operationalized by the PISA framework—is a key predictor of educational attainment, labour-market outcomes, and economic growth. Despite extensive behavioral-genetic research on cognitive ability, the heritability of competency-based literacy measures remains largely unexamined, particularly outside Western populations. The present study addresses this ...

Added: May 26, 2026

ADDITIVE AUTOMORPHISMS OF REGULAR MATRIX GRAPH

Gusev I., Maksaev A., Promyslov V., Journal of Mathematical Sciences 2025 Vol. 299 No. 6

The regular graph of the space of n × m matrices over a field F is defined as the undirected graph whose vertices are matrices of rank min(n, m), and distinct matrices A and B are connected by an edge if and only if rk(A + B) < min(n, m). In this paper, for |F| ...

Added: May 25, 2026

How virtual urban green spaces influence stress and risk-taking

Dorri Sedeh S., Kosonogov V., Kerimova N. et al., Frontiers in Psychology 2026 Vol. 17 Article 1710257

Introduction: As cities continue to grow, access to natural environments is becoming more limited, contributing to increased stress levels in urban populations. Panoramic 360° videos provide a creative and scalable means of simulating natural environments, potentially reducing stress in city residents under controlled settings. Here, we examined whether short immersive experiences in different urban environments support ...

Added: May 25, 2026

Novelty, Category and Orientation Tuning for Printed Characters: A Magnetoencephalography Study with Fast Periodic Visual Stimulation

Kochetkova Ekaterina, Kostanian D., Martynova O. et al., Brain Topography 2026 Vol. 39 No. 4 Article 51

Letter recognition is assumed to involve several levels of analysis, including coarse tuning for category and novelty and more fine tuning for specific features, related to letter orientation. We employed an oddball fast periodic visual stimulation (FPVS) paradigm with magnetoencephalography (Elekta VectorView, 306 sensors) to study neural discrimination responses in the source space. Using contrasts ...

Added: May 24, 2026

Ising models on the hydrogen peroxide and other lattices

Qian X., Deng Y., Shchur L. et al., Physica A: Statistical Mechanics and its Applications 2026 Vol. 696 P. 1–13

We perform a Monte Carlo analysis of the Ising model on many three-dimensional lattices. By means of finite-size scaling we obtain the critical points and determine the scaling dimensions. As expected, the critical exponents agree with the three-dimensional Ising universality class for all models. The irrelevant field, as revealed by the correction-to-scaling amplitudes, appears to ...

Added: May 24, 2026

Coping with AI errors with provable guarantees

Tyukin I., Tyukina T., van Helden D. P. et al., Information Sciences 2024 Vol. 678 Article 120856

AI errors pose a significant challenge, hindering real-world applications. This work introduces a novel approach to cope with AI errors using weakly supervised error correctors that guarantee a specific level of error reduction. Our correctors have low computational cost and can be used to decide whether to abstain from making an unsafe classification. We provide ...

Added: May 23, 2026

Overcoming the Curse of Dimensionality with Synolitic AI

Zaikin A., Sviridov I., Sosedka A. et al., Technologies 2026 Vol. 14 No. 2 Article 84

High-dimensional tabular data are common in biomedical and clinical research, yet conventional machine learning methods often struggle in such settings due to data scarcity, feature redundancy, and limited generalization. In this study, we systematically evaluate Synolitic Graph Neural Networks (SGNNs), a framework that transforms high-dimensional samples into sample-specific graphs by training ensembles of low-dimensional pairwise ...

Added: May 23, 2026

Разработка микросервиса ADP для идентификации источников выбросов на основе машинного обучения с подкреплением

Kychkin A., Chernitsin I., Прикладная информатика 2026 № 1(121) С. 40–58

The results of the development of a software microservice embedded in atmospheric air quality monitoring systems to support the identification of industrial pollution sources are presented. The emission and subsequent spread of harmful substances in the lower layers of the atmosphere is dynamic and characterized by high uncertainty due to the specific features of technological ...

Added: April 23, 2026

Artificial Neural Networks and Machine Learning. ICANN 2025 International Workshops and Special Sessions: 34th International Conference on Artificial Neural Networks, Kaunas, Lithuania, September 9–12, 2025, Proceedings, Part V

Cham: Springer, 2025.

This book constitutes the refereed proceedings of 34th International Workshops which were held in conjunction with the 34th International Conference on Artificial Neural Networks and Machine Learning, ICANN 2025, held in Kaunas, Lithuania, September 9–12, 2025. The 20 full papers and 8 abstracts included in this workshop volume were carefully reviewed and selected from 42 submissions. ...

Added: September 29, 2025

Analysis of a Company Model in Conditions of Unstable Demand Using Reinforcement Learning Methods

Delev A., Semakov S., , in: 2025 8th International Conference on Artificial Intelligence and Big Data (ICAIBD).: IEEE, 2025. P. 318–322.

Profit is one of the most important economic indicators of a company’s performance, and for every company it is necessary to allocate resources in such a way as to obtain the maximum possible profit. The profit maximization problem is usually a dynamic optimization problem. This article discusses an approach to solving the production expansion problem ...

Added: August 25, 2025

Pseudo-collusion in a centralized algorithmic financial market

Pastushkov A., Boulatov A., Finance Research Letters 2025 Vol. 83 Article 107671

Recent studies have increasingly explored whether reinforcement learning algorithms can give rise to cooperative behavior that results in non-competitive pricing across various market settings. In financial markets, Cartea et al. (2022) show that market makers using multi-armed bandit (MAB) algorithms generally converge to competitive pricing in quote-driven over-the-counter (OTC) markets, barring some unlikely exceptions where ...

Added: June 19, 2025

Взрослость в эпоху нестабильности: вызовы и возможности

Бемлер Е. С., Социологические исследования 2025 № 7 С. 125–134

The article presents a comprehensive analysis of narratives about adulthood as an age identity and a period of life in the context of contemporary Russia. Particular attention is paid to the age group 40-60 years, which often finds itself on the margins of social research and policy. The study, conducted in a qualitative paradigm, includes ...

Added: May 8, 2025

The beer game bullwhip effect mitigation: a deep reinforcement learning approach

Rozhkov M., Alyamovskaya N., Zakhodiakin G., International Journal of Production Research 2025 Vol. 63 No. 18 P. 6630–6647

This article investigates the application of reinforcement learning (RL) methods to optimise a four-echelon linear supply chain model with stochastic demand. The proposed supply chain configuration is largely based on the production-distribution supply chain of the MIT Supply Chain Beer Game. We show that RL can significantly improve ordering efficiency and overall supply chain performance. ...

Added: March 24, 2025

Сомали: очередной виток напряженности

Хайруллин Т. Р., Коротаев А.В., Азия и Африка сегодня 2025 № 3 С. 14–21

The article examines another round of destabilization in Somalia caused by conflicts amid territorial and economic disputes over Somali ports, as well as contradictions between federal and regional authorities. It is shown that the deal concluded in early 2024 between Ethiopia and unrecognized Somaliland led to a serious round of tension that threatened to escalate ...

Added: March 19, 2025