Grid Path Planning with Deep Reinforcement Learning: Preliminary Results

A. I. Panov; K. Yakovlev; Suvorov R. E.

doi:10.1016/j.procs.2018.01.054

Publications

?

Grid Path Planning with Deep Reinforcement Learning: Preliminary Results

Procedia Computer Science. 2018. Vol. 123. P. 347–353.

Panov A. I., Yakovlev K., Suvorov R. E.

Single-shot grid-based path finding is an important problem with the applications in robotics, video games etc. Typically in AI community heuristic search methods (based on A And its variations) are used to solve it. In this work we present the results of preliminary studies on how neural networks can be utilized to path planning on square grids, e.g. how well they can cope with path finding tasks by themselves within the well-known reinforcement problem statement. Conducted experiments show that the agent using neural Q-learning algorithm robustly learns to achieve the goal on small maps and demonstrate promising results on the maps have ben never seen by him before.

Research target: Computer Science

Priority areas: IT and mathematics

Language: English

DOI

Text on another site

Keywords: reinforcement learning path planning convolution networks

Proceedings of IEMTRONICS 2025 International IoT, Electronics and Mechatronics Conference, Volume 1

Singapore: Springer Singapore, 2026.

This book gathers selected research papers presented at IEMTRONICS 2025 (International IoT, Electronics and Mechatronics Conference), held during 3–5 April 2025 in London, United Kingdom, in hybrid mode. This book presents a collection of state-of-the-art research work involving cutting-edge technologies in the field of IoT, electronics mechatronics, and related areas. The work is presented in ...

Added: April 11, 2026

Restricted inverse optimal value problem on linear programming under weighted l1 norm

Jia J., Guan X., Pardalos P. M., Journal of Computational and Applied Mathematics 2026 Vol. 486 Article 117687

We study the restricted inverse optimal value problem on linear programming under weighted l1 norm (RIOVLP1). Given a linear programming problem (LP with a feasible solution x0 and a value K, we aim to adjust the cost vector c to such that x0 becomes an optimal solution of the problem (LP) whose objective value equals K. The objective function is to minimize the distance under weighted l1 norm. First, we reformulate ...

Added: April 10, 2026

ОБУЧЕНИЕ РАСПОЗНАВАНИЮ ЭМОЦИЙ ПОСРЕДСТВОМ МОБИЛЬНОГО ПРИЛОЖЕНИЯ «ТРОПЭМО»

Shadrina E. V., Мохова В. О., Загоскин В. А. et al., Нижегородский психологический альманах 2024 № 2

The article considers the problem of learning of recognizing emotions from pictures. A review and analysis of domestic and foreign works of scientists dealing with the problem of emotional intelligence was carried out. Its formation, influence on human activity and existing variants of its structure were considered, and common features in the understanding of emotional ...

Added: April 9, 2026

Learning When to Personalize: LLM Based Playlist Generation via Query Taxonomy and Classification

Buzaev F., Пугачёва Д. В., Sukharev I. et al., Transactions of the Association for Computational Linguistics 2026 P. 51–57

Playlist generation based on textual queries using large language models (LLMs) is becoming an important interaction paradigm for music streaming platforms. User queries span a wide spectrum from highly personalized intent to essentially catalog-style requests. Existing systems typically rely on non-personalized retrieval/ranking or apply a fixed level of preference conditioning to every query, which can ...

Added: April 7, 2026

On the Optimal Decomposition of the U-UV Codes

Kuvshinov A., Fominykh A., Ivanov F., IEEE Access 2026 Vol. 14 P. 50549–50557

The recursive (U|U+V) construction, a generalization of which includes polar codes, provides a powerful framework for building complex codes from simpler components. However, existing approaches predominantly rely on fixed or symmetric tree architectures, overlooking the critical impact of decomposition choice on code performance. This paper addresses the challenge of optimal tree decomposition selection by presenting a framework ...

Added: April 7, 2026

Using predefined vector systems to speed up neural network multimillion class classification

Gabdullin N., Androsov I., / Series Computer Science "arxiv.org". 2026.

Label prediction in neural networks (NNs) has O(n) complexity proportional to the number of classes. This holds true for classification using fully connected layers and cosine similarity with some set of class prototypes. In this paper we show that if NN latent space (LS) geometry is known and possesses specific properties, label prediction complexity can ...

Added: April 2, 2026

Математическое и компьютерное моделирование в экономике, страховании и управлении рисками: сборник статей. Выпуск 10. Материалы XIV Научно-практической конференции

Саратов: Саратовский университет, 2025.

В сборнике представлены материалы XIV Научно-практической конференции «Математическое и компьютерное моделирование в экономике, страховании и управлении рисками». Тематика статей затрагивает круг вопросов, связанных с экономикоматематическим и компьютерным моделированием и управлением рисками в финансовой деятельности, страховании, банковском деле, инвестировании, государственном управлении экономикой, бизнес-информатике и других разделах экономикоматематических знаний. Для сотрудников банков, финансовых и страховых компаний, экономических отделов организаций, служб управления ...

Added: March 31, 2026

МОДИФИЦИРОВАННАЯ ГРАВИТАЦИОННАЯ МОДЕЛЬ ОЦЕНКИ ДОСТУПНОСТИ МЕДИЦИНСКИХ УСЛУГ: ЗАДАЧА, АЛГОРИТМ И РЕАЛИЗАЦИЯ

Begicheva A., Бегичева С. В., Прикладная информатика 2025 Т. 20 № 5 (119) С. 4–21

Territorial inequality in access to healthcare remains a pressing issue for the healthcare system of the Russian Federation. Significant disparities in transport accessibility, staffing levels, and the spatial distribution of medical facilities complicate evidence-based decision-making, especially in regions with uneven population density and fragmented infrastructure. This creates the need for formalized and reproducible approaches to ...

Added: March 30, 2026

A framework for text mining on Twitter: a case study on joint comprehensive plan of action (JCPOA)- between 2015 and 2019

Behzadidoost R., Quality and Quantity 2021 Vol. 56 No. 5 P. 3053–3084

In the big data era, there is a necessity for effective frameworks to collect, retrieve, and manage data. As not all tweets are hashtagged by users, retrieving them is a complicated task. To address this issue, we present a rule-based expert system classifier that uses the well-known concept of fingerprint in the judicial sciences. This ...

Added: March 27, 2026

О задаче построения децентрализованной интеллектуальной транспортной системы на основе протокола RAFT и кластеризации по сетевому расстоянию.

Kaperko A., Городничев М. Г., Саксонов Е. А. et al., Вестник Рязанского государственного радиотехнического университета, Российская Федерация 2025 № 94 С. 59–67

The article is devoted to the development and experimental evaluation of a decentralized architecture for an intelligent transport system (ITS) based on the Raft consensus protocol and the network distance metric (RTT) server clustering method. It is shown that existing solutions either require manual configuration and centralized coordination, or are not optimized for latency with ...

Added: March 25, 2026

Потенциал терапевтического применения спектроскопии в ближней инфракрасной области после инсульта (обзор)

Mokienko O., Современные технологии в медицине 2025 Т. 17 № 2 С. 73–85

The advancement of novel technologies for the rehabilitation of post-stroke patients represents a significant challenge for a range of interdisciplinary fields. Near-infrared spectroscopy (NIRS) is an optical neuroimaging technique based on recording local hemodynamic changes at the cerebral cortex level. The technology is typically employed in post-stroke patients for diagnostic purposes, including the assessment of ...

Added: March 18, 2026

О СЛОЖНОСТИ ПРОБЛЕМЫ ТОТАЛЬНОЙ ВЫВОДИМОСТИ В НЕУКОРАЧИВАЮЩИХ И КОНТЕКСТНО-СВОБОДНЫХ ГРАММАТИКАХ

Dudakov S., Карлов Б. Н., Доклады Российской академии наук. Математика, информатика, процессы управления (ранее - Доклады Академии Наук. Математика) 2025 Т. 524 № 1 С. 11–18

In this paper we study the problem of total derivability in context-free, noncontracting, and context-sensitive grammars. Given a grammar and a terminal word, one has to determine whether there exists a derivation of this word which uses each production no less than a given number of times. It is proved that the problem of total ...

Added: March 18, 2026

Тренировки представления движения и интерфейс мозг-компьютер в когнитивной реабилитации

Лабор В. В., Mokienko O., Черкасова А. Н. et al., Журнал неврологии и психиатрии им. С.С. Корсакова 2025 Т. 125 № 11 С. 27–35

В статье представлен обзор исследований, посвященных применению тренировок представления движения и интерфейсов мозг-компьютер (ИМК) для когнитивной реабилитации пациентов с неврологическими заболеваниями. На основе анализа исследований, опубликованных с 2004 по 2025 г., проведена оценка эффективности данных методов в восстановлении когнитивных функций у пациентов с инсультом (13 исследований), болезнью Паркинсона (4) и рассеянным склерозом (2). Большинство исследований демонстрирует положительное влияние тренировок представления движения на когнитивные функции пациентов с неврологическими заболеваниями и когнитивным дефицитом средней ...

Added: March 18, 2026

Diagnostic Accuracy of AI for Opportunistic Screening of Abdominal Aortic Aneurysm in CT: A Systematic Review and Narrative Synthesis

Kodenko M., Vasilev Y., Vladzymyrskyy A. et al., Diagnostics 2022 Vol. 12 No. 12 Article 3197

In this review, we focused on the applicability of artificial intelligence (AI) for opportunistic abdominal aortic aneurysm (AAA) detection in computed tomography (CT). We used the academic search system PubMed as the primary source for the literature search and Google Scholar as a supplementary source of evidence. We searched through 2 February 2022. All studies ...

Added: March 18, 2026

Аналоги GitHub из Китая … и их готовность к международному участию

Silakov D., Системный администратор 2026 № 1-2 С. 46–51

События последних лет показали, что подобно многим другим сферам международного сотрудничества, разработка свободного ПО подвержена влиянию политических веяний. Доступность кода тех или иных открытых проектов, не говоря уже о возможности участия в их разработке, может в одно мгновение оказаться под вопросом. Как обезопасить себя от подобных сценариев на уровне государства и при этом не замкнуться ...

Added: March 17, 2026

Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - (Volume 1) ICINCO 2024

SciTePress, 2025.

This book contains the proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics. This year, ICINCO is held in Porto, Portugal, on November 18-20, 2024. It was sponsored by the Institute for Systems and Technologies of Information, Control and Communication (INSTICC), and technically co-sponsored by the IEEE Systems, Man and Cybernetics ...

Added: March 17, 2026

11th International Conference on Automation, Robotics, and Applications (ICARA 2025)

IEEE, 2025.

On behalf of the organizing committee, it is our great privilege to present this compendium of research articles for the 11th International Conference on Automation, Robotics, and Applications (ICARA 2025), which will be held in the vibrant city of Zagreb, Croatia from February 12 to 14, 2025. The proceedings encapsulate the latest advancements and innovative research in the ...

Added: March 17, 2026

16th International Conference "Intelligent Systems" 2024 (INTELS'2024), Vol 2603

Springer, 2025.

The three-volume set CCIS 2603, 2604 and 2605 constitutes the refereed proceedings of the 16th International Conference on Intelligent Systems, INTELS 2024, held in Moscow, Russia, during December 2–4, 2024. The 72 papers included in these proceedings were carefully reviewed and selected from 140 submissions. They focus on areas of intelligent systems and artificial intelligence and their application ...

Added: March 17, 2026

Decision-Making in Computational Intelligence-Based Systems. Studies in Systems, Decision and Control, vol 628

Cham: Springer, 2025.

This book delivers actionable insights through 21 peer-reviewed chapters featuring new methods, models, and applications based on computational intelligence. Discover cutting-edge tools to support smart, efficient decision-making in complex, real-world scenarios. Organized into three parts—prescriptive analytics, soft computing models, and practical case studies—it spans domains such as healthcare, energy, mobility, finance, and public services. Readers ...

Added: March 17, 2026

Formation control of unmanned aerial vehicle swarms for outdoor monitoring in search and rescue tasks[Управление роем беспилотных летательных аппаратов для мониторинга открытой местности при поисково-спасательных операциях]

Frolov O., Safin R., Tsoy T. et al., Ученые записки Казанского университета. Серия: Физико-математические науки 2025 Vol. 167 No. 4 P. 786–805

Advancements in robotics have expanded a use of unmanned aerial vehicle (UAV) swarms in critical tasks such as disaster response, including search and rescue operations during floods, hurricanes, landsliding, and earthquakes. Swarm formation control stands as a critical challenge in UAV swarm control. In this article, a simple and resource-efficient method for addressing collisions within swarm formations during outdoor ...

Added: March 17, 2026

Iterative Ricci-Foster Curvature Flow with GMM-Based Edge Pruning: A Novel Approach to Community Detection

Sorokin K., Beketov M., Онучин А. et al., / arxiv.org. Серия cs.SI "Social and Information Networks ". 2025.

Community detection in complex networks is a fundamental problem, open to new approaches in various scientific settings. We introduce a novel community detection method, based on Ricci flow on graphs. Our technique iteratively updates edge weights (their metric lengths) according to their (combinatorial) Foster version of Ricci curvature computed from effective resistance distance between the ...

Added: January 15, 2026

Implementing Transport Coding in OMNeT++ for Message Delay Reduction

Petrovanov I., Sergeev A., / Series Computer Science "arxiv.org". 2025. No. 2512.18332.

Transport coding reduces message delay in packet-switched networks by introducing controlled redundancy at the transport layer: original packets are encoded into coded packets, and the message is reconstructed after the first successful deliveries, effectively shifting latency from the maximum packet delay to the -th order statistic. We present a concise, reproducible discrete-event implementation of transport coding in OMNeT++, including ...

Added: December 24, 2025

Hessian-based lightweight neural network for brain vessel segmentation on a minimal training dataset

Меньшиков И. А., Бернадотт А. К., Elvimov N. S., / Series arXie "Statistical mechanics". 2025.

Accurate segmentation of blood vessels in brain magnetic resonance angiography (MRA) is essential for successful surgical procedures, such as aneurysm repair or bypass surgery. Currently, annotation is primarily performed through manual segmentation or classical methods, such as the Frangi filter, which often lack sufficient accuracy. Neural networks have emerged as powerful tools for medical image ...

Added: December 1, 2025

Implementation of Rev1 and Rev2 Bug Family Algorithms in ROS Noetic

Roslavtsev M., Eryomin A., Safin R. et al., , in: 2024 8th International Conference on Information, Control, and Communication Technologies (ICCT).: IEEE, 2024. P. 1–5.

Modern map-dependent algorithms for mobile robot navigation typically overload a CPU and memory with a gradually increasing amount of environmental data. In contrast, Bug family local path planning algorithms operate without mapping and have significantly lower hardware requirements. Bug algorithms use real-time measurements from visual and touch sensors to make immediate decisions on direction of ...

Added: November 25, 2025