Random forests with parametric entropy-based information gains for classification and regression problems

V. Ignatenko; A. Surkov; Sergei Koltcov

doi:10.7717/peerj-cs.1775

Publications

?

Random forests with parametric entropy-based information gains for classification and regression problems

PeerJ Computer Science. 2024. Vol. 10. Article e1775.

Ignatenko V., Surkov A., Sergei Koltcov

The random forest algorithm is one of the most popular and commonly used algorithms
for classification and regression tasks. It combines the output of multiple decision trees
to form a single result. Random forest algorithms demonstrate the highest accuracy on
tabular data compared to other algorithms in various applications. However, random
forests and, more precisely, decision trees, are usually built with the application of
classic Shannon entropy. In this article, we consider the potential of deformed entropies,
which are successfully used in the field of complex systems, to increase the prediction
accuracy of random forest algorithms. We develop and introduce the information gains
based on Renyi, Tsallis, and Sharma-Mittal entropies for classification and regression
random forests. We test the proposed algorithm modifications on six benchmark
datasets: three for classification and three for regression problems. For classification
problems, the application of Renyi entropy allows us to improve the random forest
prediction accuracy by 19-96% in dependence on the dataset, Tsallis entropy improves
the accuracy by 20-98%, and Sharma-Mittal entropy improves accuracy by 22-111%
compared to the classical algorithm. For regression problems, the application of
deformed entropies improves the prediction by 2-23% in terms of R2 in dependence
on the dataset.

Research target: Computer Science

Language: English

Full text

DOI

Keywords: Renyi entropy Tsallis entropy Sharma-Mittal entropy random forest Classification regression

Publication based on the results of:

Modelling information and communication behaviour in computer-mediated environments and improving algorithms for behavioural data analysis (2024)

The recognition-by-components method

Mylnikov L., Slivnitsin P., Engineering Applications of Artificial Intelligence 2026 Vol. 179 Article 115185

The paper describes a applied artificial intelligence task of recognition-by-components method of real objects based on the recognition of a limited set of primitives or components. The recognition-by-components makes it possible to determine the components, that compose an object, and increase the number of recognizable objects without degrading the recognition quality. Training is performed on ...

Added: May 29, 2026

Brain-Computer Interfaces for Gait Rehabilitation After Stroke A Scoping Review

Mokienko O., Zisman M. A., Bobrov P. et al., American Journal of Physical Medicine and Rehabilitation 2026 Vol. 105 No. 6 P. 555–563

Brain-computer interfaces (BCIs) represent a promising technology for restoring lower limb motor functions and gait after stroke. The application of BCIs in this field is supported by a limited number of studies. The objective of the review was to systematically and critically evaluate the current evidence on the use of BCIs for lower limb function ...

Added: May 28, 2026

ИНФОРМАЦИОННЫЕ ТЕХНОЛОГИИ И ТЕХНИЧЕСКИЕ СРЕДСТВА УПРАВЛЕНИЯ (ICCT-2024)

М.: Институт проблем управления им. В.А. Трапезникова РАН, 2024.

В сборник вошли материалы VIII Международной научной конференции «Информационные технологии и технические средства управления» (ICCT-2024). На конференции были рассмотрены вопросы, касающиеся перспектив развития научного приборостроения в телекоммуникационных и управляющих системах, биомедицинской информатики, аппаратного и программного обеспечения информационнокоммуникационных систем, надежности, диагностики и неразрушающего контроля, систем управления и автоматизации, цифровых экосистем, управления производством и логистикой, методов математического ...

Added: May 27, 2026

Non-linear in-band interference cancellation on base of conjugate gradients method

Degtyarev A., Bakhurin S., Yudin N., DSPA 2026 P. 1–6

This paper investigates one possible solution to the problem of self-interference cancellation (SIC) arising in the design of in-band full-duplex (IBFD) communication systems. Self-interference cancellation is performed in the digital domain using multilayer nonlinear models adapted via gradient-based optimization. The presence of local minima and saddle points during the adaptation of multilayer models limits the ...

Added: May 26, 2026

28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy – Including 14th Conference on Prestigious Applications of Intelligent Systems (PAIS 2025)

IOS Press, 2025.

Added: May 26, 2026

Comparative Study of Training Methods and Architectures of Echo State Networks

Androsov I., Proceedings of the Institute for System Programming of the RAS 2026 Vol. 38 No. 3 P. 87–114

This paper examines echo state networks (ESNs), one of the most prevalent approaches to implementing reservoir computing. An ESN consists of a recurrent neural network with fixed (untrained) weights and a readout layer that is typically linear and trainable. This approach enables the creation of energyefficient and computationally efficient neural networks capable of real-time learning. However, since ...

Added: May 26, 2026

Рефакторинг исходного кода на основе LLM и расширения UML

Караваева Е. А., Кулигин Л. А., Rezunik L. et al., Труды Института системного программирования РАН 2026 Т. 38 № 3 С. 67–94

В статье представлен метод рефакторинга исходного кода на основе интеграции большой языковой модели (LLM) и расширенной UML-модели программного кода. Предложенный подход позволяет выявлять проблемные участки кода с использованием функций тревожности и структурных метрик классов, а затем выполнять автоматизированный рефакторинг. Ключевой особенностью метода является использование LLM для генерации формальных спецификаций на языке OCL (Object Constraint Language), ...

Added: May 24, 2026

Coping with AI errors with provable guarantees

Tyukin I., Tyukina T., van Helden D. P. et al., Information Sciences 2024 Vol. 678 Article 120856

AI errors pose a significant challenge, hindering real-world applications. This work introduces a novel approach to cope with AI errors using weakly supervised error correctors that guarantee a specific level of error reduction. Our correctors have low computational cost and can be used to decide whether to abstain from making an unsafe classification. We provide ...

Added: May 23, 2026

Overcoming the Curse of Dimensionality with Synolitic AI

Zaikin A., Sviridov I., Sosedka A. et al., Technologies 2026 Vol. 14 No. 2 Article 84

High-dimensional tabular data are common in biomedical and clinical research, yet conventional machine learning methods often struggle in such settings due to data scarcity, feature redundancy, and limited generalization. In this study, we systematically evaluate Synolitic Graph Neural Networks (SGNNs), a framework that transforms high-dimensional samples into sample-specific graphs by training ensembles of low-dimensional pairwise ...

Added: May 23, 2026

Stable On-the-Fly Learning for Dynamic Neural Networks With Delayed Inputs

Chertopolokhov V., Mukhamedov A., Bugriy G. et al., IEEE Access 2026 Vol. 14 P. 14369–14392

This study presents on-the-fly identification and multi-step prediction of nonlinear systems with delayed inputs using a dynamic neural network combined with a smooth projection onto ellipsoids. The projection enforces parameter constraints that guarantee stability, while a Lyapunov–Krasovskii analysis yields computable ultimate error bounds. Riccati-type matrix inequalities are derived, providing an efficient vectorization–projection–devectorization implementation suitable for ...

Added: May 22, 2026

Опыт применения сетевого анализа (SNA) в историческом нарративе полисубъектного региона (на примере валлийской хроники Brut y Tywysogyon)

Loshkareva M. E., Matveeva N., Вестник Томского государственного университета. История 2026 № 100 С. 112–118

This research is an endeavor to apply social network analysis (SNA) to the study of a medieval narrative source. The authors suppose that the use of network analysis may offer new possibilities in the study of the history of regions characterized by some political fragmentation. Authors tried to construct networks of historical interactions from 1193 ...

Added: May 22, 2026

Reproducible Benchmark of Wavelet-Enhanced Intrabody Communication Biometric Identification

Jin S., Komarov M. M., Scientific Reports 2026

Intrabody communication (IBC) channels offer physiological diversity that can be leveraged for passive biometric identification in wearable devices. Recent reports of over 99 per cent identification accuracy have frequently resulted from data leakage, where samples from the same subject are seen in both training and evaluation, yielding inflated and unreliable metrics. In this work, we ...

Added: May 21, 2026

ML-based Fast Simulation of FARICH Responses

Shipilov F., Barnyakov A., Ivanov A. et al., / Series Physics "arxiv.org". 2026.

A fast simulation of the detector response is a vital task in high-energy physics (HEP). Traditional Monte-Carlo methods form the backbone of modern particle physics simulation software but are computationally expensive. We present a machine-learning-based approach to fast simulation of the Focusing Aerogel Ring Imaging Cherenkov (FARICH) detector response. Given a particle track and momentum, ...

Added: May 19, 2026

Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations)

Rabat: Association for Computational Linguistics, 2026.

Added: May 19, 2026

Dataset of solubility values for organic compounds in binary mixtures of solvents at various temperatures

Bezzubov S., Malikov D., Krasnov L. et al., Scientific data 2026 Vol. 13 Article 727

Solubility is a crucial property of organic compounds, impacting their potential applications in synthetic chemistry, materials science and drug design. Moreover, in technological processes mixtures of solvents are often utilized, making the solubility assessment more complicated. Predicting solubility values in mixtures of solvents from a molecular structure can help to address this issue, although a ...

Added: May 19, 2026

Aerokinesis: An IoT-Based Vision-Driven Gesture Control System for Quadcopter Navigation Using Deep Learning and ROS2

Kondratev S., Yulia Dyrchenkova, Georgiy Nikitin et al., Technologies 2026 Vol. 14 No. 1 Article 69

This paper presents Aerokinesis, an IoT-based software–hardware system for intuitive gesture-driven control of quadcopter unmanned aerial vehicles (UAVs), developed within the Robot Operating System 2 (ROS2) framework. The proposed system addresses the challenge of providing an accessible human–drone interaction interface for operators in scenarios where traditional remote controllers are impractical or unavailable. The architecture comprises ...

Added: May 19, 2026

Aerokinesis: An IoT-Based Vision-Driven Gesture Control System for Quadcopter Navigation Using Deep Learning and ROS2

Kondratev S., Yulia Dyrchenkova, Georgiy Nikitin et al., Technologies 2026 Vol. 14 No. 1 Article 69

Added: May 19, 2026

Parallel Computational Technologies. PCT 2025

Springer, 2025.

This book constitutes the refereed proceedings of the 19th International Conference on Parallel Computational Technologies, PCT 2025, held in Moscow, Russia, during April 8–10, 2025. The 31 full papers included in this volume were carefully reviewed and selected from 122 submissions. These papers were organized under the following topical sections: High Performance Architectures, Tools and Technologies; ...

Added: May 18, 2026

KMHCR: A Key-Controlled Signal-Domain Transformation for 5G IoT Security

Ronglin Z., Wei L., Jiahong C. et al., Journal of Signal Processing Systems 2026 Vol. 98 Article 31

To address the need for lightweight and low-latency protection in massive resource-constrained 5G Internet of Things (IoT) systems, this paper proposes Key-Controlled Modulation Hopping and Constellation Rotation (KMHCR). KMHCR is designed as a physical-layer confidentiality-enhancement mechanism that avoids bit-wise full-payload encryption in the protection pipeline. It uses a shared key derived from channel-reciprocity secret key ...

Added: May 16, 2026

DPN Verifier: A Toolkit for Faster Soundness Verification and Repair of Process Models with Data

Suvorov N. M., Proceedings of the Institute for System Programming of the RAS 2026 Vol. 38 No. 3(2) P. 49–66

Data Petri Nets (DPNs) extend classical Petri nets to model processes where data directly influences control-flow, enabling a comprehensive view of system behavior and possibility to detect failure points that could otherwise be hidden. Soundness is a correctness criterion that captures such failure points as deadlocks and livelocks as well as model boundedness and absence ...

Added: May 16, 2026

Dynamic states in a network of type-I Morris-Lecar neurons characterized using the Metric Framework

Радушев Д. О., Dogonasheva O., Гуткин Б. С. et al., Chaos 2026 Vol. 36 No. 5 P. 1–10

In recent decades, analysis of dynamic states in neural networks has become an important direction of the synchronization theory. One of the most interesting neuronal network states is the chimera state, in which coherent and incoherent activity clusters coexist. While chimera states have been shown to exist in various networks, their precise automatic identification in ...

Added: May 13, 2026

QGKM: A Quantum Fidelity-Based Graph Clustering Framework for Robust Data Pattern Recognition in Education Social Networks

Xiong N., Long W., He D. et al., Algorithms 2026 Vol. 19 No. 5 Article 386

In the era of data-driven education, educational social networks generate large volumes of high-dimensional and complex-structured data through learner interactions, collaborative activities, and resource-sharing behaviors, posing significant challenges to traditional unsupervised learning methods. Such data often exhibit non-convex distributions, heterogeneity, and noise sensitivity, making conventional clustering approaches insufficient for capturing their intrinsic structural relationships. To ...

Added: May 13, 2026

Построение системы опережающих индикаторов для прогнозирования валютного кризиса

Shchepeleva M., Финансы: теория и практика 2025 Т. 29 № 4 С. 146–162

This research is devoted to the analysis of financial crises. We examine different classifications of crises, methods of forecasting, approaches to building systems of early warning indicators. To better understand the potential for predicting financial crises, we conduct our own empirical research, comparing Logit model and random forest to predict currency crises in developing countries. ...

Added: February 12, 2026

Классификации и классификаторы в науке и аналитике

Isakov V., Юридическая техника 2024 № 18 С. 17–31

This consultation is devoted to two closely interrelated issues: the first part examines the logical and methodological foundations of the classification approach in analytics, the second part applies this approach to the analytics itself as an object of classification, considers its types and types. ...

Added: December 15, 2025