• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Articles
  • Scale-free memory model for multiagent reinforcement learning. Mean field approximation and rock-paper-scissors dynamics
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 25, 2026
HSE Scientists Train Neural Network to 'Hear' Faults in Electric Motors
Researchers at the AI and Digital Science Institute of the HSE Faculty of Computer Science have developed a new method—the Signature-Guided Data Augmentation (SGDA) framework—that achieves 99% accuracy in motor fault detection and 86% accuracy in fault classification. The application of this approach can reduce industrial equipment repair costs, minimise downtime, and improve production safety. The study results have been published in Engineering Applications of Artificial Intelligence.
May 25, 2026
'The Humanities Serve as a Conscience'
Maria Mizernaia studies Soviet literature and the history of book publishing. In this interview for the HSE Young Scientists project, she discusses plans to publish a novel about besieged Leningrad, AI-provoked reflections on what it means to be human, and how novels can help satisfy our dopamine hunger.
May 25, 2026
Is It Possible to Predict a Citys Life Based on the Shape of Its Neighbourhoods?
Is it possible to predict, based on the configuration of streets and buildings, where a café will open or where traffic congestion will occur? Participants in the Spatial Analysis and Modelling of Urban Processes research and study group use open data and machine learning to identify universal patterns. Alexander Sheludkov and Eduard Somov discuss the purpose of comparing cities, the need for new forms of urban statistics, and how open data is transforming approaches to urban studies.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Scale-free memory model for multiagent reinforcement learning. Mean field approximation and rock-paper-scissors dynamics

The European Physical Journal B. 2010. Vol. 76. No. 1. P. 69–85.
Lubashevsky I., Kanemoto S.

A continuous time model for multiagent systems governed by reinforcement learning with scale-free memory is developed. The agents are assumed to act independently of one another in optimizing their choice of possible actions via trial-and-error search. To gain awareness about the action value the agents accumulate in their memory the rewards obtained from taking a specific action at each moment of time. The contribution of the rewards in the past to the agent current perception of action value is described by an integral operator with a power-law kernel. Finally a fractional differential equation governing the system dynamics is obtained. The agents are considered to interact with one another implicitly via the reward of one agent depending on the choice of the other agents. The pairwise interaction model is adopted to describe this effect. As a specific example of systems with non-transitive interactions, a two agent and three agent systems of the rock-paper-scissors type are analyzed in detail, including the stability analysis and numerical simulation. Scale-free memory is demonstrated to cause complex dynamics of the systems at hand. In particular, it is shown that there can be simultaneously two modes of the system instability undergoing subcritical and supercritical bifurcation, with the latter one exhibiting anomalous oscillations with the amplitude and period growing with time. Besides, the instability onset via this supercritical mode may be regarded as “altruism self-organization”. For the three agent system the instability dynamics is found to be rather irregular and can be composed of alternate fragments of oscillations different in their properties.

Research target: Physics Psychology Mathematics
Language: English
Full text
DOI
Text on another site
Keywords: reinforcement learninginstabilityrock-paper-scissors gamescale-free memory
Similar publications
Численное моделирование полевой эмиссии из полупроводникового катода в вакуум
Borisov V., Danilov V., Электросвязь 2025 № 12 С. 73–84
Представлены результаты математического моделирования процесса полевой эмиссии из катода малых размеров – одного из основных физических процессов, обеспечивающих работы многих электронных устройств, в частности FED-дисплеев (устройства, работающие на принципе полевой эмиссии), кантилеверов и т.д. Дается краткий обзор текущих результатов в области исследования, обосновывается актуальность задачи, приводятся примеры наиболее вероятного использования результатов решения задачи. Обсуждаются физические ...
Added: May 29, 2026
Методическая концепция развития навыков саморегуляции у студентов-музыкантов в классе вокала
Ван Г., Торопова А. В., Музыкальное искусство и образование 2025 Т. 13 № 4 С. 73–91
Contemporary vocal pedagogy faces a complex set of methodological challenges that encompass not only the search for ways to improve technical vocal skills for the realization of artistic vision, but also the development of self-control and self-regulation in future singers, as well as the entire creative process. A successful artist cannot function without the ability ...
Added: May 29, 2026
ИССЛЕДОВАНИЕ АССОЦИАЦИИ ГЕНЕТИЧЕСКИХ ВАРИАНТОВ С РАЗВИТИЕМ МУЗЫКАЛЬНЫХ СПОСОБНОСТЕЙ ЧЕЛОВЕКА
Kazantseva A. V., A.V. Toropova, Khusnutdinova E. K. et al., ВАВИЛОВСКИЙ ЖУРНАЛ ГЕНЕТИКИ И СЕЛЕКЦИИ, Федеральный исследовательский центр Институт цитологии и генетики Сибирского отделения Российской академии наук» (ИЦиГ СО РАН) (Новосибирск) 2025 Vol. 30 No. 3 P. 470–481
The development of musical abilities, including absolute pitch, musical memory, rhythm sense, and musicality, at a high degree is determined by a hereditary component (up to 68 %). The studies implementing a genome-wide linkage and association approach to musical aptitude have revealed more than 100 genetic loci. This spectrum is comprised of the genes encoding ...
Added: May 29, 2026
Electrical networks and data analysis in phylogenetics
Gorbounov Vassily, Kazakov A., Data Analytics and Topology 2025 Vol. 1 No. 1 P. 33–45
A classic problem in data analysis is studying the systems of subsets defined by either a similarity or a dissimilarity function on X which is either observed directly or derived from a data set. For an electrical network there are two functions on the set of the nodes defined by the resistance matrix and the response ...
Added: May 28, 2026
Почему растущие доходы не делают людей счастливее: эмоциональное объяснение парадокса Истерлина
Vorchik A., Вопросы экономики 2026
This work is devoted to a theoretical explanation of the Easterlin paradox, according to which long-term economic growth does not make average level of people's happiness increasing. By happiness, we mean the intensity of emotions people experience while comparing their new income with its expected value, or the target income with its original value. In the first case, ...
Added: May 28, 2026
Электростатически управляемый контроль диссипации в двумерном наноэлектромеханическом резонаторе через напряжение-амплитудный антагонизм с рекордной 928% подстройкой добротности.
Arutyunov K., JIANG1 Q., FANG J. et al., Science China Information Sciences 2026 Vol. 69 No. 6 P. 1–11
Resonators based on nanoelectromechanical systems (NEMS) using two-dimensional (2D) materials with high-quality factors and excellent electrical control are critical for tunable coherent phonon dynamics, resonant sensors and wireless communications. However, their performance is fundamentally limited by the lack of a unified framework governing energy dissipation mechanisms and their electrical tunability. Here, we synergistically modulate both ...
Added: May 28, 2026
Enhanced Terahertz Thermoelectricity Via Engineered Van Hove Singularities and Nernst Effect in Moiré Superlattices
Elesin L., Shilov A., Jana S. et al., Advanced Functional Materials 2026 P. 1–10
Thermoelectric materials, long explored for energy harvesting and thermal sensing, convert heat directly into electrical signals. Extending their application to the terahertz (THz) frequency range opens opportunities for low-noise, bias-free THz detection, yet conventional thermoelectrics lack the sensitivity required for practical devices. Thermoelectric coefficients can be strongly enhanced near van Hove singularities (VHS), though these ...
Added: May 28, 2026
Sweet-taste liking is associated with preference for less risky and immediate rewards in economic decision-making
Давидович А. С., Shestakova A., Arzumanyan N. et al., Frontiers in Psychology 2026 Vol. 17 - 2026 P. 1–18
Background:  Delay discounting refers to the tendency to choose sooner, smaller rewards over larger, later rewards. Many previous studies link this tendency positively to reward sensitivity, yet the specific mechanisms behind this association remain poorly understood. Reward sensitivity may relate to delay discounting through at least three possible pathways: increased sensitivity to reward size, increased sensitivity ...
Added: May 27, 2026
Non-linear in-band interference cancellation on base of conjugate gradients method
Degtyarev A., Bakhurin S., Yudin N., DSPA 2026 P. 1–6
This paper investigates one possible solution to the problem of self-interference cancellation (SIC) arising in the design of in-band full-duplex (IBFD) communication systems. Self-interference cancellation is performed in the digital domain using multilayer nonlinear models adapted via gradient-based optimization. The presence of local minima and saddle points during the adaptation of multilayer models limits the ...
Added: May 26, 2026
New Numerical Invariants of an Unfolding of a Polycycle “Tears of the Heart”
Ilyashenko Y., Shilin I., Stanislav Minkov, Russian Journal of Mathematical Physics 2026 Vol. 33 No. 1 P. 89–106
In this paper, new numerical invariants of structurally unstable vector fields in the plane are found. One of the main tools is an improved asymptotics of sparkling saddle connections that occur when a separatrix loop of a hyperbolic saddle breaks. Another main tool is a new topological invariant of two arithmetic progressions, both perturbed and unperturbed, on the ...
Added: May 26, 2026
Heritability of Functional Literacy: Evidence from a Classical Twin Design
Kolachev N., Kovaleva G., Behavior Genetics 2026
Functional literacy—the ability to apply reading, mathematical, and scientific knowledge in authentic contexts as operationalized by the PISA framework—is a key predictor of educational attainment, labour-market outcomes, and economic growth. Despite extensive behavioral-genetic research on cognitive ability, the heritability of competency-based literacy measures remains largely unexamined, particularly outside Western populations. The present study addresses this ...
Added: May 26, 2026
ADDITIVE AUTOMORPHISMS OF REGULAR MATRIX GRAPH
Gusev I., Maksaev A., Promyslov V., Journal of Mathematical Sciences 2025 Vol. 299 No. 6
The regular graph of the space of n × m matrices over a field F is defined as the undirected graph whose vertices are matrices of rank min(n, m), and distinct matrices A and B are connected by an edge if and only if rk(A + B) < min(n, m). In this paper, for |F| ...
Added: May 25, 2026
How virtual urban green spaces influence stress and risk-taking
Dorri Sedeh S., Kosonogov V., Kerimova N. et al., Frontiers in Psychology 2026 Vol. 17 Article 1710257
Introduction:  As cities continue to grow, access to natural environments is becoming more limited, contributing to increased stress levels in urban populations. Panoramic 360° videos provide a creative and scalable means of simulating natural environments, potentially reducing stress in city residents under controlled settings. Here, we examined whether short immersive experiences in different urban environments support ...
Added: May 25, 2026
Novelty, Category and Orientation Tuning for Printed Characters: A Magnetoencephalography Study with Fast Periodic Visual Stimulation
Kochetkova Ekaterina, Kostanian D., Martynova O. et al., Brain Topography 2026 Vol. 39 No. 4 Article 51
Letter recognition is assumed to involve several levels of analysis, including coarse tuning for category and novelty and more fine tuning for specific features, related to letter orientation. We employed an oddball fast periodic visual stimulation (FPVS) paradigm with magnetoencephalography (Elekta VectorView, 306 sensors) to study neural discrimination responses in the source space. Using contrasts ...
Added: May 24, 2026
Ising models on the hydrogen peroxide and other lattices
Qian X., Deng Y., Shchur L. et al., Physica A: Statistical Mechanics and its Applications 2026 Vol. 696 P. 1–13
We perform a Monte Carlo analysis of the Ising model on many three-dimensional lattices. By means of finite-size scaling we obtain the critical points and determine the scaling dimensions. As expected, the critical exponents agree with the three-dimensional Ising universality class for all models. The irrelevant field, as revealed by the correction-to-scaling amplitudes, appears to ...
Added: May 24, 2026
Coping with AI errors with provable guarantees
Tyukin I., Tyukina T., van Helden D. P. et al., Information Sciences 2024 Vol. 678 Article 120856
AI errors pose a significant challenge, hindering real-world applications. This work introduces a novel approach to cope with AI errors using weakly supervised error correctors that guarantee a specific level of error reduction. Our correctors have low computational cost and can be used to decide whether to abstain from making an unsafe classification. We provide ...
Added: May 23, 2026
Overcoming the Curse of Dimensionality with Synolitic AI
Zaikin A., Sviridov I., Sosedka A. et al., Technologies 2026 Vol. 14 No. 2 Article 84
High-dimensional tabular data are common in biomedical and clinical research, yet conventional machine learning methods often struggle in such settings due to data scarcity, feature redundancy, and limited generalization. In this study, we systematically evaluate Synolitic Graph Neural Networks (SGNNs), a framework that transforms high-dimensional samples into sample-specific graphs by training ensembles of low-dimensional pairwise ...
Added: May 23, 2026
Разработка микросервиса ADP для идентификации источников выбросов на основе машинного обучения с подкреплением
Kychkin A., Chernitsin I., Прикладная информатика 2026 № 1(121) С. 40–58
The results of the development of a software microservice embedded in atmospheric air quality monitoring systems to support the identification of industrial pollution sources are presented. The emission and subsequent spread of harmful substances in the lower layers of the atmosphere is dynamic and characterized by high uncertainty due to the specific features of technological ...
Added: April 23, 2026
Artificial Neural Networks and Machine Learning. ICANN 2025 International Workshops and Special Sessions: 34th International Conference on Artificial Neural Networks, Kaunas, Lithuania, September 9–12, 2025, Proceedings, Part V
Cham: Springer, 2025.
This book constitutes the refereed proceedings of 34th International Workshops which were held in conjunction with the 34th International Conference on Artificial Neural Networks and Machine Learning, ICANN 2025, held in Kaunas, Lithuania, September 9–12, 2025.   The 20 full papers and 8 abstracts included in this workshop volume were carefully reviewed and selected from 42 submissions. ...
Added: September 29, 2025
Analysis of a Company Model in Conditions of Unstable Demand Using Reinforcement Learning Methods
Delev A., Semakov S., , in: 2025 8th International Conference on Artificial Intelligence and Big Data (ICAIBD).: IEEE, 2025. P. 318–322.
Profit is one of the most important economic indicators of a company’s performance, and for every company it is necessary to allocate resources in such a way as to obtain the maximum possible profit. The profit maximization problem is usually a dynamic optimization problem. This article discusses an approach to solving the production expansion problem ...
Added: August 25, 2025
Pseudo-collusion in a centralized algorithmic financial market
Pastushkov A., Boulatov A., Finance Research Letters 2025 Vol. 83 Article 107671
Recent studies have increasingly explored whether reinforcement learning algorithms can give rise to cooperative behavior that results in non-competitive pricing across various market settings. In financial markets, Cartea et al. (2022) show that market makers using multi-armed bandit (MAB) algorithms generally converge to competitive pricing in quote-driven over-the-counter (OTC) markets, barring some unlikely exceptions where ...
Added: June 19, 2025
Взрослость в эпоху нестабильности: вызовы и возможности
Бемлер Е. С., Социологические исследования 2025 № 7 С. 125–134
The article presents a comprehensive analysis of narratives about adulthood as an age identity and a period of life in the context of contemporary Russia. Particular attention is paid to the age group 40-60 years, which often finds itself on the margins of social research and policy. The study, conducted in a qualitative paradigm, includes ...
Added: May 8, 2025
The beer game bullwhip effect mitigation: a deep reinforcement learning approach
Rozhkov M., Alyamovskaya N., Zakhodiakin G., International Journal of Production Research 2025 Vol. 63 No. 18 P. 6630–6647
This article investigates the application of reinforcement learning (RL) methods to optimise a four-echelon linear supply chain model with stochastic demand. The proposed supply chain configuration is largely based on the production-distribution supply chain of the MIT Supply Chain Beer Game. We show that RL can significantly improve ordering efficiency and overall supply chain performance. ...
Added: March 24, 2025
Сомали: очередной виток напряженности
Хайруллин Т. Р., Коротаев А.В., Азия и Африка сегодня 2025 № 3 С. 14–21
The article examines another round of destabilization in Somalia caused by conflicts amid territorial and economic disputes over Somali ports, as well as contradictions between federal and regional authorities. It is shown that the deal concluded in early 2024 between Ethiopia and unrecognized Somaliland led to a serious round of tension that threatened to escalate ...
Added: March 19, 2025
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit