Scale-free memory model for multiagent reinforcement learning. Mean field approximation and rock-paper-scissors dynamics

?

Scale-free memory model for multiagent reinforcement learning. Mean field approximation and rock-paper-scissors dynamics

The European Physical Journal B. 2010. Vol. 76. No. 1. P. 69–85.

Lubashevsky I., Kanemoto S.

A continuous time model for multiagent systems governed by reinforcement learning with scale-free memory is developed. The agents are assumed to act independently of one another in optimizing their choice of possible actions via trial-and-error search. To gain awareness about the action value the agents accumulate in their memory the rewards obtained from taking a specific action at each moment of time. The contribution of the rewards in the past to the agent current perception of action value is described by an integral operator with a power-law kernel. Finally a fractional differential equation governing the system dynamics is obtained. The agents are considered to interact with one another implicitly via the reward of one agent depending on the choice of the other agents. The pairwise interaction model is adopted to describe this effect. As a specific example of systems with non-transitive interactions, a two agent and three agent systems of the rock-paper-scissors type are analyzed in detail, including the stability analysis and numerical simulation. Scale-free memory is demonstrated to cause complex dynamics of the systems at hand. In particular, it is shown that there can be simultaneously two modes of the system instability undergoing subcritical and supercritical bifurcation, with the latter one exhibiting anomalous oscillations with the amplitude and period growing with time. Besides, the instability onset via this supercritical mode may be regarded as “altruism self-organization”. For the three agent system the instability dynamics is found to be rather irregular and can be composed of alternate fragments of oscillations different in their properties.

Research target: Physics Psychology Mathematics

Unstable Dynamics of Adaptation in Unknown Environment due to Novelty Seeking

Lubashevsky I., Zgonnikov A., Advances in Complex Systems 2014 Vol. 17 No. 3-4 Article 1450013

Learning and adaptation play great role in emergent socio-economic phenomena. Complex dynamics has been previously found in the systems of multiple learning agents interacting via a simple game. Meanwhile, the single agent adaptation is considered trivially stable. We advocate the idea that adopting a more complex model of the individual behavior may result in a ...

Added: November 6, 2021

Self-Organized Criticality and Cognitive Control Reasoned by Effort Minimization

Lubashevskiy V., Lubashevsky I., Systems 2023 Vol. 11 No. 6 Article 271

We put forward a novel model for self-organized criticality in the dynamics of systems controlled by human actions. The model is based on two premises. First, without human control, the system in issue undergoes supercritical instability. Second, the subject’s actions are aimed at preventing the occurrence of critical fluctuations when the risk of control failure ...

Added: June 26, 2023

Время в комиксах

Каллендер К., Эдней Р., М.: Эксмо, 2019.

What is time and how to manage it? Does it really flow at different speeds? Are time travel real? The issues associated with this phenomenon are the most complex and at the same time the most interesting. After reading this comic, you will learn: • What is the fourth dimension? • What prevents an egg ...

Added: November 1, 2019

Physics of the Human Temporality: Complex Present

Lubashevsky I., Plavinska N., Springer, 2021.

This book presents a novel account of the human temporal dimension called the “human temporality” and develops a special mathematical formalism for describing such an object as the human mind. One of the characteristic features of the human mind is its temporal extent. For objects of physical reality, only the present exists, which may be ...

Added: October 22, 2021

Dynamical Traps Caused by Fuzzy Rationality as a New Emergence Mechanism

Lubashevsky I., Advances in Complex Systems 2012 Vol. 15 No. 8 Article 1250045

Tools Share Abstract A new emergence mechanism related to the human fuzzy rationality is considered. It assumes that individuals (operators) governing the dynamics of a certain system try to follow an optimal strategy in controlling its motion but fail to do this perfectly because similar strategies are indistinguishable for them. The main attention is focused on the systems ...

Added: November 6, 2021

Albert Galeev: The Problem of Metastability and Explosive Reconnection

Zelenyi L. M., Malova H. V., V. Yu. Popov et al., Plasma Physics Reports 2021 Vol. 47 P. 857–877

−Albert Abubakirovich Galeev is a Soviet and Russian expert in plasma physics who actively contributed to fusion research. In the early 1970s, he became a head of department at the Space Research Institute of the Academy of Sciences of USSR and began devoting most of his time to the problems of the physics of space ...

Added: October 2, 2021

Новая российская энциклопедия

М.: Энциклопедия, Инфра-М, 2017.

The New Russian Encyclopedia is a fundamental reference publication in 18 volumes that characterizes nature, population, economy, history, science, art, technology and other important aspects. Contains about 60,000 articles, about 30,000 biographies, about 15,000 color illustrations, maps, charts, diagrams, tables. Leaves since 2003. ...

Added: October 29, 2018

Fast reconfiguration of high frequency brain networks in response to surprising changes in auditory input

Nicol R., Chapman S., Vertes P. et al., Journal of Neurophysiology (США) 2012 Vol. 107 No. 5 P. 1421–1430

How do human brain networks react to dynamic changes in the sensory environment? We measured rapid changes in brain network organization in response to brief, discrete, salient auditory stimuli. We estimated network topology and distance parameters in the immediate central response period, <1 s following auditory presentation of standard tones interspersed with occasional deviant tones ...

Added: October 23, 2014

Proceedings of the 34th Annual Meeting of the International Society for Psychophysics

Leuphana University Lüneburg, 2019.

Added: November 5, 2021

Новая российская энциклопедия

М.: Энциклопедия, Инфра-М, 2018.

Added: October 26, 2018

Вестник молодых ученых ПГНИУ [Электронный ресурс]: сб. науч. тр.

Пермь: Пермский государственный национальный исследовательский университет, 2014.

В сборнике собраны статьи студентов и молодых ученых ПГНИУ, отражающие результаты научных исследований, выполняемых на базе университета. Статьи посвящены актуальным проблемам изучения естественных и гуманитарных наук. Сборник издается по итогам конкурса научно-исследовательских работ студентов ПГНИУ (апрель – ноябрь 2014 г.), в котором принимали участие все факультеты университета. ...

Added: December 30, 2014

Магнитоэнцефалография – новейший метод функционального картирования мозга человека

Shestakova A., Буторина А., Ossadtchi A. et al., Экспериментальная психология 2012 Т. 5 № 2 С. 119–134

Статья посвящена методу магнитоэнцефалографии (МЭГ) и его применению в когнитивных исследованиях. МЭГ – одна из современных технологий нейроимиджинга. Данный метод обладает уникальными характеристиками, позволяющими с высокой точностью локализовать источники активности нейронных популяций коры головного мозга человека в пространстве и времени. Наряду с исследованиями базовых сенсорных и моторных функций мозга, МЭГ является незаменимым инструментом исследования динамики ...

Added: October 23, 2014

Stochastic theory of the classical molecular dynamics method

Norman G., Stegailov V., Mathematical Models and Computer Simulations 2013 Vol. 5 No. 4 P. 305–333

The work is devoted to fundamental aspects of the classical molecular dynamics method, which was developed half a century ago as a means of solving computational problems in statistical physics and has now become one of the most important numerical methods in the theory of condensed state. At the same time, the molecular dynamics method ...

Added: March 19, 2014

Are mathematicians, physicists and biologists irrational? Mathematical and natural science studies vs. the transitivity axiom

Poddiakov A., / Series SSRN Working Paper Series "SSRN Working Paper Series". 2023.

An important and interesting phenomenon of the last few decades is the increasing number of mathematical studies of so-called intransitive dice with non-standard numbers on their faces and the popularization of them. The dice beat one another like in the rock-paper-scissors game. They violate the transitivity law (or axiom): “if it were true that whenever ...

Added: February 23, 2023

Handbook of Applications of Chaos Theory

CRC Press, 2016.

In addition to explaining and modeling unexplored phenomena in nature and society, chaos uses vital parts of nonlinear dynamical systems theory and established chaotic theory to open new frontiers and fields of study. Handbook of Applications of Chaos Theory covers the main parts of chaos theory along with various applications to diverse areas. Expert contributors ...

Added: October 26, 2021

Новая российская энциклопедия

М.: Энциклопедия, Инфра-М, 2018.

Added: October 26, 2018

Complexity of human response delay in intermittent control: The case of virtual stick balancing

Lubashevsky I., Suzuki T., Zgonnikov A., / Series arXiv "Neurons and Cognition (q-bio.NC)". 2018. No. 1808.05002.

Response delay is an inherent and essential part of human actions. In the context of human balance control, the response delay is traditionally modeled using the formalism of delay-differential equations, which adopts the approximation of fixed delay. However, experimental studies revealing substantial variability, adaptive anticipation, and non-stationary dynamics of response delay provide evidence against this approximation. In this ...

Added: November 3, 2021

Новая российская энциклопедия

М.: Энциклопедия, Инфра-М, 2017.

Added: October 31, 2018

Новая российская энциклопедия

М.: Энциклопедия, Инфра-М, 2018.

Added: October 28, 2018

Сборник Тезисов 2‐й Всероссийской интернет‐конференции «Грани науки 2013»

Каз.: СМУиС, 2013.

Вторая Всероссийская молодежная научная Интернет-конференция «Грани науки» проводится Казанским (Приволжским) федеральным университетом, Советом молодых ученых и специалистов города Казани (http://kznscience.ru) и Комитетом по делам детей и молодежи Исполкома Казани. ...

Added: July 7, 2016

XLIX итоговая студенческая научная конференция Удмуртского государственного университета: Материалы всероссийской конференции, (апрель 2021 г.)

Ижевск: Удмуртский университет, 2021.

В сборнике опубликованы материалы докладов XLIX Итоговой студенческой научной конференции (апрель 2021 г.). В конференции приняли участие студенты учебных институтов и филиалов УдГУ. Представ- лены материалы по гуманитарным, естественным и техническим специальностям: история, филология, пси- хология, педагогика, биология, химия, физика, математика, экономика, энергетика и др. Сборник предназначен для преподавателей и студентов вузов. ...

Added: August 14, 2023

To react or not to react? Intrinsic stochasticity of human control in virtual stick balancing

Lubashevsky I., Zgonnikov A., Kanemoto S. et al., Journal of the Royal Society Interface 2014 Vol. 11 No. 99 Article 20140636

Understanding how humans control unstable systems is central to many research problems, with applications ranging from quiet standing to aircraft landing. Increasingly, much evidence appears in favour of event-driven control hypothesis: human operators only start actively controlling the system when the discrepancy between the current and desired system states becomes large enough. The event-driven models ...

Added: November 6, 2021

Are Mathematicians, Physicists and Biologists Irrational? Intransitivity Studies vs. the Transitivity Axiom

Poddiakov A., Human Arenas. An Interdisciplinary Journal of Psychology, Culture, and Meaning 2024

The status of the axioms of transitivity of dominance (“if x dominates y and y dominates z, then x dominates z” and “if a person prefers A to B and B to C, then that person should prefer A to C”) as key components of rationality is discussed. The discussion is conducted in the context ...

Added: September 18, 2024

Новая российская энциклопедия

М.: Энциклопедия, Инфра-М, 2017.

Added: October 29, 2018