MAGNet: Multi-Agent Graph Network for Deep Multi-Agent Reinforcement Learning

A. Shpilman; Malysheva A.; Kudenko D.

doi:10.1109/REDUNDANCY48165.2019.9003345

Publications

?

MAGNet: Multi-Agent Graph Network for Deep Multi-Agent Reinforcement Learning

P. 171–176.

Shpilman A., Malysheva A., Kudenko D.

Over recent years, deep reinforcement learning has shown strong successes in complex single-Agent tasks, and more recently this approach has also been applied to multi-Agent domains. In this paper, we propose a novel approach, called MAGNet, to multi-Agent reinforcement learning that utilizes a relevance graph representation of the environment obtained by a self-Attention mechanism, and a message-generation technique. We applied our MAGnet approach to the synthetic predator-prey multi-Agent environment and the Pommerman game and the results show that it significantly outperforms state-of-the-art MARL solutions, including Multi-Agent Deep Q-Networks (MADQN), Multi-Agent Deep Deterministic Policy Gradient (MADDPG), and QMIX.

Language: English

DOI

Text on another site

Keywords: машинное обучение machine learning Deep Reinforcement Learning глубокое обучение с подкреплением

In book

Proceedings of 2019 XVI International Symposium "Problems of Redundancy in Information and Control Systems" (REDUNDANCY)

IEEE, 2019.

Исследование точности метода градиентного бустинга со случайными поворотами.

Kitov V. V., Экономика, статистика и информатика. Вестник УМО 2016 № 4 С. 22–26

Gradient boosting method with random rotations is considered, where before training each base learner random rotation is applied to the feature space. The accuracy metric of the given method is estimated for a broad range of generated problems of binary classification. Obtained results are evaluated and recommendations given for application of this method. ...

Added: August 23, 2016

Voting: a machine learning approach

Clemens Puppe, Burka D., Szepesváry L. et al., / Series ISSN 2190-9806 "KIT Working paper in Economics". 2020. No. 145.

Voting rules can be assessed from quite different perspectives: the axiomatic, the pragmatic, in terms of computational or conceptual simplicity, susceptibility to manipulation, and many others aspects. In this paper, we take the machine learning perspective and ask how ‘well’ a few prominent voting rules can be learned by a neural network. To address this ...

Added: October 31, 2021

Referential Choice: Predictability and Its Limits

Kibrik A. A., Khudyakova M., Dobrov G. B. et al., Frontiers in Psychology 2016 Vol. 7 No. 1429 P. 1–21

We report a study of referential choice in discourse production, understood as the choice between various types of referential devices, such as pronouns and full noun phrases. Our goal is to predict referential choice, and to explore to what extent such prediction is possible. Our approach to referential choice includes a cognitively informed theoretical component, ...

Added: September 28, 2016

Analysis of Images, Social Networks and Texts Third International Conference, AIST 2014, Yekaterinburg, Russia, April 10-12, 2014, Revised Selected Papers

Berlin: Springer, 2014.

This book constitutes the proceedings of the Third International Conference on Analysis of Images, Social Networks and Texts, AIST 2014, held in Yekaterinburg, Russia, in April 2014. The 11 full and 10 short papers were carefully reviewed and selected from 74 submissions. They are presented together with 3 short industrial papers, 4 invited papers and ...

Added: November 13, 2014

A Deep Learning Method Study of User Interest Classification

Malafeev A., Nikolaev K., , in: Analysis of Images, Social Networks and Texts. 8th International Conference, AIST 2019, Kazan, Russia, July 17–19, 2019, Revised Selected Papers. Communications in Computer and Information ScienceVol. 1086. Springer, 2020. P. 154–159.

In this paper, a deep learning method study is conducted to solve a new multiclass text classification problem, identifying user interests by text messages. We used an original dataset of almost 90 thousand forum text messages, labeled for ten interests. We experimented with different modern neural network architectures: recurrent and convolutional, as well as simpler ...

Added: November 7, 2019

Bimodal Cross-Validation Approach for Recommender Systems Diagnostics

Ignatov D. I., Poelmans J., , in: Diagnostic Test Approaches to Machine Learning and Commonsense Reasoning Systems. Hershey: IGI Global, 2012. Ch. 8 P. 185–195.

Recommender systems are becoming an inseparable part of many modern Internet web sites and web shops. The quality of recommendations made may significantly influence the browsing experience of the user and revenues made by web site owners. Developers can choose between a variety of recommender algorithms; unfortunately no general scheme exists for evaluation of their ...

Added: December 3, 2012

ПРОЕКТНОЕ ПРЕДЛОЖЕНИЕ: АВТОМАТИЗИРОВАННЫЙ ПОДХОД К РЕКОМЕНДАТЕЛЬНЫМ СИСТЕМАМ

Сендерович М. А., В кн.: Межвузовская научно-техническая конференция студентов, аспирантов и молодых специалистов им. Е.В. Арменского. М.: МИЭМ НИУ ВШЭ, 2019. С. 223–224.

Данная работа посвящена актуальной теме автоматизации в машинном обучении на примере создания универсальной рекомендательной системы. В работе исследуются различные типы рекомендательных систем, акцент делается на подходы коллаборативной фильтрации. Изучаются методы автоматизации машинного обучения, на основе которых будет разработана данная рекомендательная система. ...

Added: October 31, 2020

Faster variational inducing input Gaussian process classification

Izmailov P., Kropotov D., Journal of machine learning and data analysis 2017 Vol. 3 No. 1 P. 20–35

Background: Gaussian processes (GP) provide an elegant and effective approach to learning in kernel machines. This approach leads to a highly interpretable model and allows using the Bayesian framework for model adaptation and incorporating the prior knowledge about the problem. The GP framework is successfully applied to regression, classification, and dimensionality reduction problems. Unfortunately, the ...

Added: December 6, 2018

Классификация коннектомов на основе локальных метрик на стохастических матрицах

Ivanov A., Petrov D., В кн.: Сборник статей конференции "Информационные технологии и системы" (ИТиС'16). М.: ИППИ РАН, 2016. С. 509–516.

Многие графовые метрики основаны на предположении, что веса графа представляют расстояния между вершинами, которые мы можем складывать. Если считать эти метрики для стохастических матриц случайного блуждания на графе, то физический смысл вероятностей перехода между вершинами теряется (поскольку вероятности переходов перемножаются, а не складываются). Мы предлагаем решать эту проблему использованием отрицательных логарифмов весов ребер. Используя этот ...

Added: December 15, 2016

Epileptogenic high-frequency oscillations present larger amplitude both in mesial temporal and neocortical regions

Karpychev V., Balatskaya A., Utyashev N. et al., Frontiers in Human Neuroscience 2022 No. 16 Article 984306

High-frequency oscillations (HFO) are a promising biomarker for the identification of epileptogenic tissue. While HFO rates have been shown to predict seizure outcome, it is not yet clear whether their morphological features might improve this prediction. We validated HFO rates against seizure outcome and delineated the distribution of HFO morphological features. We collected stereo-EEG recordings ...

Added: October 1, 2022

Pupillometry and autonomic nervous system responses to cognitive load and false feedback: an unsupervised machine learning approach

Evgeniia I. Alshanskaia, Portnova G., Liaukovich K. et al., Frontiers in Neuroscience 2024 Vol. 18 Article 1445697

Objectives: Pupil dilation is controlled both by sympathetic and parasympathetic nervous system branches. We hypothesized that the dynamic of pupil size changes under cognitive load with additional false feedback can predict individual behavior along with heart rate variability (HRV) patterns and eye movements reflecting specific adaptability to cognitive stress. To test this, we employed an ...

Added: September 2, 2024

Классификация коннектомов на основе локальных метрик на стохастических матрицах

Иванов А. Р., Petrov D., В кн.: 40-я междисциплинарная школа-конференция "Информационные технологии и системы". [б.и.], 2016. С. 509–516.

Графовые метрики – популярный подход для клас- сификации структурных коннектомов, графов опи- сывающих структурные связи между различными участками мозга. В нашей работе мы предлагаем считать эти метрики на стохастических матри- цах случайных блужданий этих графов. При этом часть этих метрик мы предлагаем считать на логарифмах элементов матриц, чтобы сохранить физический смысл вероятностей перехода меж- ду ...

Added: December 9, 2016

Prediction of Drug-like Compounds Solubility in Supercritical Carbon Dioxide: A Comparative Study between Classical Density Functional Theory and Machine Learning Approaches

Makarov D., Nikolai N. Kalikin, Yury A. Budkov, Industrial & Engineering Chemistry Research 2024 Vol. 63 No. 3 P. 1589–1603

Supercritical carbon dioxide (scCO2) plays an essential role in various technological procedures, making the solubility of drugs in scCO2 a crucial aspect of the drug formulation process. This study focuses on utilizing theoretical approaches to predict the solubility of drug-like compounds in scCO2 in order to select the optimum parameters for subsequent experimental procedures. Several machine ...

Added: January 16, 2024

Constructing a Lexical Resource of Russian Derivational Morphology

Kyjánek L., Lyashevskaya O., Nedoluzhko A. et al., , in: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022). Marseille: European Language Resources Association (ELRA), 2022. Ch. 298 P. 2788–2797.

Words of any language are to some extent related thought the ways they are formed. For instance, the verb ‘exempl-ify’ and the noun ‘example-s’ are both based on the word ‘example’, but the verb is derived from it, while the noun is inflected. In Natural Language Processing of Russian, the inflection is satisfactorily processed; however, ...

Added: February 22, 2023

Application of the Method of Multivariate Multi-stage Forecasting Based on the LSTM Deep Learning Model for Bitcoin Price Time Series

Natalia Sizykh, Said Dandamaev, Dmitry Sizykh, , in: 16th International Conference Management of large-scale system development (MLSD). IEEE, 2023. P. 1–5.

Forecasting data and research on cryptocurrency price forecasting methods are increasing in importance. So far, methods based on LSTM deep learning architecture have shown the best results in forecasting cryptocurrency prices. In order to improve the accuracy of forecasting data, this paper investigates the application of a multivariate multistep forecasting method based on the LSTM ...

Added: December 22, 2023

Исследовательский проект как инструмент обучения методам анализа текста: предсказание класса поста в социальной сети

Suvorova A., Смирнова К. Р., Будин Е. А. et al., Компьютерные инструменты в образовании 2018 № 3 С. 49–64

The article describes a student research project on predicting the class of a post on a social network based on its textual content. The features of the project are discussed as an integral part of the trajectory of teaching data analysis methods, including text analysis methods and tools that are often not included in machine ...

Added: January 28, 2019

Использование метода главных компонент для анализа надежности цепей поставок

Kuznetsov V. O., Логистика и управление цепями поставок 2018 № 4 (87) С. 27–33

One of the options for a more flexible approach to analyzing the reliability of supply chains is the principal component analysis (PCA). With a large number of variables describing supply chain, it is a difficult task to analyze the structure of variables in two-dimensional space. Within the analysis of the variables dependencies PCA allows to ...

Added: November 29, 2018

Что в профиле тебе моем: Данные «ВКонтакте» как инструмент изучения интересов современных подростков

Polivanova K. N., Smirnov I., Вопросы образования 2017 № 2 С. 134–152

Children’s interests play a key role in their psychological development. However, research in this field is associated with serious methodological problems, as it has traditionally used questionnaire surveys that cannot adequately describe the diverse and dynamic world of interests of a developing person. The article suggests using the information on VKontakte communities followed by teenagers, ...

Added: July 21, 2017

Formal Concept Analysis: 16th International Conference, ICFCA 2021, Strasbourg, France, June 29 – July 2, 2021, Proceedings

Springer, 2021.

This book constitutes the proceedings of the 16th International Conference on Formal Concept Analysis, ICFCA 2021, held in Strasbourg, France, in June/July 2021. The 14 full papers and 5 short papers presented in this volume were carefully reviewed and selected from 32 submissions. The book also contains four invited contributions in full paper length. The research part ...

Added: July 10, 2021

Predictive Analytics Approach for Steel Billets Quality Control System

Belov A. V., Ekaterina A. Melekhova, Vorontsova T., , in: 2022 International Conference on Quality Management, Transport and Information Security, Information Technologies (IT&QM&IS). St. Petersburg: IEEE, 2022. P. 219–223.

The paper deals with the problem of improving the quality of metal products. Nowadays destructive methods of quality control of the steel billets prevail at metallurgical enterprises. This approach to assessing the quality of the steel billets is wasteful, which increases its cost. One of the ways to reduce the cost of production of metal ...

Added: January 28, 2023

Предсказания, большие данные и новые измерители: о возможности технологий компьютерной лингвистики в теоретических лингвистических исследованиях

Bonch-Osmolovskaya A. A., Вопросы языкознания 2016 № 2 С. 100–120

Статья посвящена обзору работ последних лет, в которых теоретическая исследовательская задача решается с помощью методов или инструментов, используемых в компьютерной лингвистике. В обзоре проводится подробный анализ того, как именно с помощью применения того или иного инструмента или метода можно получить новые знания о природе языка. В частности, выделяются два основных направления, развитие которых в рамках ...

Added: April 14, 2015

Supernova search with active learning in ZTF DR3

Pruzhinskaya M., Ishida E. O., Novinskaya A. et al., Astronomy and Astrophysics 2023 Vol. 672 Article A111

Context. We provide the first results from the complete SNAD adaptive learning pipeline in the context of a broad scope of data from large-scale astronomical surveys. Aims. The main goal of this work is to explore the potential of adaptive learning techniques in application to big data sets. Methods. Our SNAD team used Active Anomaly Discovery (AAD) as ...

Added: June 6, 2023

МАШИННОЕ ОБУЧЕНИЕ В ИССЛЕДОВАНИЯХ МЕДИКО-БИОЛОГИЧЕСКИХ И СОЦИАЛЬНО-ЭКОНОМИЧЕСКИХ ДАННЫХ

Buzmakov A. V., В кн.: МАШИННОЕ ОБУЧЕНИЕ В ИССЛЕДОВАНИЯХ МЕДИКО-БИОЛОГИЧЕСКИХ И СОЦИАЛЬНО-ЭКОНОМИЧЕСКИХ ДАННЫХ. СПб.: Федеральное государственное автономное образовательное учреждение высшего образования "Санкт-Петербургский политехнический университет Петра Великого", 2020. С. 284–333.

In many practical tasks it is needed to estimate an effect of treatment on individual level. For example, in medicine it is essential to determine the patients that would benefit from a certain medicament. In marketing, knowing the persons that are likely to buy a new product would reduce the amount of spam. In this ...

Added: December 7, 2021

Link Prediction Regression for Weighted Co-authorship Networks

Gerasimova O., Makarov I., , in: Advances in Computational Intelligence. IWANN 2019. Berlin: Springer, 2019. P. 667–677.

In this paper, we study the problem of predicting quantity of collaborations in co-authorship network. We formulated our task in terms of link prediction problem on weighted co-authorship network, formed by authors writing papers in co-authorship represented by edges between authors in the network. Our task is formulated as regression for edge weights, for which ...

Added: July 29, 2019