• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 20, 2026
HSE University Opens First Representative Office of Satellite Laboratory in Brazil
HSE University-St Petersburg opened a representative office of the Satellite Laboratory on Social Entrepreneurship at the University of Campinas in Brazil. The platform is going to unite research and educational projects in the spheres of sustainable development, communications and social innovations.
May 18, 2026
The 'Second Shift' Is Not Why Women Avoid News
Women are more likely than men to avoid political and economic news, but the reasons for this behaviour are linked less to structural inequality or family-related stress than to personal attitudes and the emotional perception of news content. This conclusion was reached by HSE researchers after analysing data from a large-scale survey of more than 10,000 residents across 61 regions of Russia. The study findings have been published in Woman in Russian Society.
May 15, 2026
Preserving Rationality in a Period of Turbulence
The HSE International Laboratory for Logic, Linguistics and Formal Philosophy studies logic and rationality in a transformed world characterised by a diversity of logical systems and rational agents. The laboratory supports and develops academic ties with Russian and international partners. The HSE News Service spoke with the head of the laboratory, Prof. Elena Dragalina-Chernaya, about its work.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned

.
Nikulin A. M., Belousov Y., Svidchenko O., Shpilman A.

Reinforcement learning competitions advance the field by providing appropriate scope and support to develop solutions toward a specific problem. To promote the development of more broadly applicable methods, organizers need to enforce the use of general techniques, the use of sample-efficient methods, and the reproducibility of the results. While beneficial for the research community, these restrictions come at a cost—increased difficulty. If the barrier for entry is too high, many potential participants are demoralized. With this in mind, we hosted the third edition of the MineRL ObtainDiamond competition, MineRL Diamond 2021, with a separate track in which we permitted any solution to promote the participation of newcomers. With this track and more extensive tutorials and support, we saw an increased number of submissions. The participants of this easier track were able to obtain a diamond, and the participants of the harder track progressed the generalizable solutions in the same task.

Language: English
Full text
DOI
Text on another site
Keywords: Deep Reinforcement LearningImitation LearningSample-efficient learningDeep Learning

In book

Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track
PMLR, 2022.
Similar publications
FiMMIA: scaling semantic perturbation-based membership inference across modalities
Kudrjashov S., Fenogenova A., Емельянов А., , in: Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations)Vol. 3: System Demonstrations.: Association for Computational Linguistics, 2026. Ch. 11 P. 139–153.
Membership Inference Attacks (MIAs) aim to determine whether a specific data point was included in the training set of a target model. Although there are have been numerous methods developed for detecting data contamination in large language models (LLMs), their performance on multimodal LLMs (MLLMs) falls short due to the instabilities introduced through multimodal component ...
Added: May 19, 2026
Knowledge Discovery, Knowledge Engineering and Knowledge Management: 15th International Joint Conference, IC3K 2023, Rome, Italy, November 13-15, 2023, Revised Selected Papers
Rome: Springer, 2025.
This book constitutes the refereed proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2023, held in Rome, Italy, during November 13-15, 2023. The 9 full papers and 8 short papers included in this book were carefully reviewed and selected from 166 submissions. They were organized in topical sections ...
Added: May 2, 2025
Weighted boxes fusion: Ensembling boxes from different object detection models
Соловьёв Р. А., Габрушева Т., Ванг В., Image and Vision Computing 2021 Vol. 107 P. 104117–0
We present a novel method for combining predictions in ensembles of different object detection models: weighted boxes fusion. This method significantly improves the quality of the fused predicted rectangles for an ensemble. We tested the method on several datasets and evaluated it in the context of the Open Images and COCO Object Detection challenges. It helped ...
Added: January 15, 2025
Fast Parametric Curve Matching (FPCM) Filters for Deep Learning-Based Automatic Spike Detection
Белокопытов А. С., Kleeva D., Ossadtchi A., , in: Advances in Neural Computation, Machine Learning, and Cognitive Research VIII.: Springer, 2024. P. 317–326.
Added: October 23, 2024
Soft Margin Spectral Normalization for GANs
Rogachev A., Ratnikov F., Computing and Software for Big Science 2024 Vol. 8 No. 1 Article 12
In this paper, we explore the use of Generative Adversarial Networks (GANs) to speed up the simulation process while ensuring that the generated results are consistent in terms of physics metrics. Our main focus is the application of spectral normalization for GANs to generate electromagnetic calorimeter (ECAL) response data, which is a crucial component of ...
Added: July 2, 2024
Real-time detection of hogweed: UAV platform empowered by deep learning
Menshchikov A., Shadrin D., Prutyanov V. et al., IEEE Transactions on Computers 2021 Vol. 70 No. 8 P. 1175–1188
The Hogweed of Sosnowskyi (lat. Heracleum sosnowskyi) is poisonous for humans, dangerous for farming crops, and local ecosystems. This plant is fast-growing and has already spread all over Eurasia: from Germany to the Siberian part of Russia, and its distribution expands year-by-year. In-situ detection of this harmful plant is a tremendous challenge for many countries. ...
Added: May 11, 2024
11th International Conference, AIST 2023, Yerevan, Armenia, September 28–30, 2023, Revised Selected Papers. Analysis of Images, Social Networks and Texts. Lecture Notes in Computer Science (LNCS, volume 14486)
Cham: Springer, 2024.
This book constitutes revised selected papers from the thoroughly refereed proceedings of the 11th International Conference on Analysis of Images, Social Networks and Texts, AIST 2023, held in Yerevan, Armenia, during September 28-30, 2023.   The 24 full papers included in this book were carefully reviewed and selected from 93 submissions. They were organized in topical sections ...
Added: March 25, 2024
Generative design of physical objects using modular framework
Nikita O. Starodubcev, Nikitin N., Andronova E. et al., Engineering Applications of Artificial Intelligence 2023 Vol. 119 Article 105715
In recent years generative design techniques have become firmly established in numerous applied fields, especially in engineering. These methods are crucial for automating the initial stages of the engineering design of various structures, which reduces the amount of routine work. However, existing approaches are limited by the specificity of the problem under consideration. In addition, ...
Added: March 5, 2024
Interaction models for remaining useful lifetime estimation
Zhevnenko D., Kazantsev M., Makarov I., Journal of Industrial Information Integration 2023 Vol. 33 Article 100444
The paper deals with the problem of controlling the state of industrial devices according to the readings of their sensors. The current methods are based on an approach to feature extraction in which the prediction occurs. We propose an interaction method of multiple blocks of different complexity, which aggregate information differently over time, to create ...
Added: February 15, 2024
TabDDPM: Modelling Tabular Data with Diffusion Models
Kotelnikov A., Baranchuk D., Ivan Rubachev et al., , in: Proceedings of the 40th International Conference on Machine Learning: Volume 202: International Conference on Machine Learning, 23-29 July 2023, Honolulu, Hawaii, USAVol. 202: International Conference on Machine Learning, 23-29 July 2023, Honolulu, Hawaii, USA.: PMLR, 2023. P. 17564–17579.
Denoising diffusion probabilistic models are becoming the leading generative modeling paradigm for many important data modalities. Being the most prevalent in the computer vision community, diffusion models have recently gained some attention in other domains, including speech, NLP, and graph-like data. In this work, we investigate if the framework of diffusion models can be advantageous ...
Added: February 11, 2024
A human learning optimization algorithm with reasoning learning
Zhang P., Du J., Wang L. et al., Applied Soft Computing Journal 2022 Vol. 122 Article 108816
Human Learning Optimization (HLO) is a simple yet powerful meta-heuristic developed based on a simplified human learning model. Many cognitive activities of humans contain an element of reasoning, and with reasoning, humans can gain deeper information on problems to boost learning performance. Inspired by this fact, this paper proposes a novel human learning optimization algorithm ...
Added: April 11, 2022
Deep Reinforcement Learning with DQN vs. PPO in VizDoom
Anton Zakharenkov, Makarov I., , in: Proceedings of IEEE 21st International Symposium on Computational Intelligence and Informatics (CINTI'21), 18-20 Nov. 2021.: NY: IEEE, 2021. P. 000131–000136.
Added: January 19, 2022
Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Laurent F., Schneider M., Scheller C. et al., , in: Proceedings of Machine Learning ResearchVol. 133: Proceedings of the NeurIPS 2020: Competition and Demonstration Track.: PMLR, 2021. P. 275–301.
The Flatland competition aimed at finding novel approaches to solve the vehicle re-scheduling problem (VRSP). The VRSP is concerned with scheduling trips in traffic networks and the re-scheduling of vehicles when disruptions occur, for example the breakdown of a vehicle. While solving the VRSP in various settings has been an active area in operations research ...
Added: September 6, 2021
Deep Reinforcement Learning in VizDoom via DQN and Actor-Critic Agents
Maria Bakhanova, Ilya Makarov, , in: Advances in Computational Intelligence: 16th International Work-Conference on Artificial Neural Networks, IWANN 2021, Virtual Event, June 16–18, 2021, Proceedings, Part I* 1. Vol. 12861.: Springer, 2021. Ch. 12 P. 138–150.
In this work, we study the problem of learning reinforcement learning-based agents in a first-person shooter environment VizDoom. We compare several well-known architectures, such as DQN, DDQN, A3C, and Curiosity-driven model, while highlighting the main differences in learned policies of agents trained via these models. ...
Added: September 1, 2021
Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments
Ivanov D., Egorov V., Shpilman A., , in: AAMAS'2021: Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems.: IFAAMAS, 2021. P. 1536–1538.
Recent reinforcement learning studies extensively explore the interplay between cooperative and competitive behaviour in mixed environments. Unlike cooperative environments where agents strive towards a common goal, mixed environments are notorious for the conflicts of selfish and social interests. As a consequence, purely rational agents often struggle to maintain cooperation. A prevalent approach to induce cooperative ...
Added: May 29, 2021
AAMAS'2021: Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems
IFAAMAS, 2021.
These are the proceedings of the 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS-2021). They are published by the International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS). ...
Added: May 29, 2021
Workshop on AI for Autonomous Driving (AIAD)
[б.и.], 2020.
Self-driving cars and advanced safety features present one of today’s greatest challenges and opportunities for Artificial Intelligence (AI). Despite billions of dollars of investments and encouraging progress under certain operational constraints, there are no driverless cars on public roads today without human safety drivers. Autonomous Driving research spans a wide spectrum, from modular architectures -- ...
Added: December 28, 2020
MAGNet: Multi-Agent Graph Network for Deep Multi-Agent Reinforcement Learning
Shpilman A., Malysheva A., Kudenko D., , in: Proceedings of 2019 XVI International Symposium "Problems of Redundancy in Information and Control Systems" (REDUNDANCY).: IEEE, 2019. P. 171–176.
Over recent years, deep reinforcement learning has shown strong successes in complex single-Agent tasks, and more recently this approach has also been applied to multi-Agent domains. In this paper, we propose a novel approach, called MAGNet, to multi-Agent reinforcement learning that utilizes a relevance graph representation of the environment obtained by a self-Attention mechanism, and ...
Added: July 15, 2020
Learning to Route in Similarity Graphs
Баранчук Д. А., Persiyanov D., Sinitsin A. et al., , in: International Conference on Machine Learning (ICML 2019).: PMLR, 2019. P. 475–484.
Recently similarity graphs became the leading paradigm for efficient nearest neighbor search, outperforming traditional tree-based and LSH-based methods. Similarity graphs perform the search via greedy routing: a query traverses the graph and in each vertex moves to the adjacent vertex that is the closest to this query. In practice, similarity graphs are often susceptible to ...
Added: March 16, 2020
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit