• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Articles
  • Random forests with parametric entropy-based information gains for classification and regression problems
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
April 30, 2026
HSE Researchers Compile Scientific Database for Studying Childrens Eating Habits
The database created at HSE University can serve as a foundation for studying children’s eating habits. This is outlined in the study ‘The Influence of Age, Gender, and Social-Role Factors on Children’s Compliance with Age-Based Nutritional Norms: An Experimental Study Using the Dish-I-Wish Web Application.’ The work has been carried out as part of the HSE Basic Research Programme and was presented at the XXVI April International Academic Conference named after Evgeny Yasin.
April 30, 2026
New Foresight Centre Study Identifies the Most Destructive Global Trends for Humankind
A team of researchers from the HSE International Research and Educational Foresight Centre has examined how global trends affect the quality of human life—from life expectancy to professional fulfilment. The findings of the study titled ‘Human Capital Transformation under the Influence of Global Trends’ were published in Foresight.
April 28, 2026
Scientists Develop Algorithm for Accurate Financial Time Series Forecasting
Researchers at the HSE Faculty of Computer Science benchmarked more than 200,000 model configurations for predicting financial asset prices and realised volatility, showing that performance can be improved by filtering out noise at specific frequencies in advance. This technique increased accuracy in 65% of cases. The authors also developed their own algorithm, which achieves accuracy comparable to that of the best models while requiring less computational power. The study has been published in Applied Soft Computing.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Random forests with parametric entropy-based information gains for classification and regression problems

PeerJ Computer Science. 2024. Vol. 10. Article e1775.
Ignatenko V., Surkov A., Sergei Koltcov

The random forest algorithm is one of the most popular and commonly used algorithms
for classification and regression tasks. It combines the output of multiple decision trees
to form a single result. Random forest algorithms demonstrate the highest accuracy on
tabular data compared to other algorithms in various applications. However, random
forests and, more precisely, decision trees, are usually built with the application of
classic Shannon entropy. In this article, we consider the potential of deformed entropies,
which are successfully used in the field of complex systems, to increase the prediction
accuracy of random forest algorithms. We develop and introduce the information gains
based on Renyi, Tsallis, and Sharma-Mittal entropies for classification and regression
random forests. We test the proposed algorithm modifications on six benchmark
datasets: three for classification and three for regression problems. For classification
problems, the application of Renyi entropy allows us to improve the random forest
prediction accuracy by 19-96% in dependence on the dataset, Tsallis entropy improves
the accuracy by 20-98%, and Sharma-Mittal entropy improves accuracy by 22-111%
compared to the classical algorithm. For regression problems, the application of
deformed entropies improves the prediction by 2-23% in terms of R2 in dependence
on the dataset.

Research target: Computer Science
Language: English
Full text
DOI
Keywords: Renyi entropyTsallis entropySharma-Mittal entropyrandom forestClassification regression
Publication based on the results of:
Modelling information and communication behaviour in computer-mediated environments and improving algorithms for behavioural data analysis (2024)
Similar publications
On the minimum number of maximal distance-k independent sets in trees
Taletskii D., / Series arXiv "math". 2026.
A vertex subset of a graph is called a \textit{distance-$k$ independent set} if the distance between any two of its distinct vertices is at least $k + 1$. For all $n,k \geq 1$, we determine the minimum possible number of inclusion-wise maximal distance-$k$ independent sets among all $n$-vertex trees. It equals~$n$ if $n \leq k ...
Added: May 1, 2026
Proceedings of the 2026 8th International Youth Conference on Radio Electronics, Electrical and Power Engineering (REEPE)
Dayoub A., Suleiman E., IEEE, 2026.
2026 8th International Youth Conference on Radio Electronics, Electrical and Power Engineering (REEPE) 1-3 April 2026 ...
Added: April 30, 2026
Интеллектуальный анализ данных в нефтегазовой отрасли
М.: ООО «Геомодель Развитие», 2024.
Интелшектуальный анализ данных в нефтегазовой отрасли, Калининград, Россия, 2024, ООО «Геомодель Развитие» ...
Added: April 29, 2026
Bioinspired Method of Agent Redistribution between Groups
Karpova Irina Petrovna, Pattern Recognition and Image Analysis 2025 Vol. 35 No. 4 P. 1138–1144
A solution to the problem of redistributing agents between groups based on simulating a form of social parasitism in ants known as slave-making is considered. To provide a comprehensive solution, the problem is integrated with a method of orientation based on visual landmarks and a compass, including route memorization and return. The models and mechanisms ...
Added: April 29, 2026
Natural hazard database from Internet publications: text mining with a large language model
Derkacheva A., Sakirkina M., Kraev G. et al., /. 2026.
Comprehensive data on natural hazards and their consequences are crucial for effective for risk assessment, adaptation planning, and emergency response. However, many countries face challenges with fragmented, inconsistent, and inaccessible data, particularly regarding local-scale events. To address this data gap in Russia, we developed an end-to-end processing pipeline that scrapes news from various online sources, ...
Added: April 28, 2026
Influence of the Normal Magnetic Component to Magnetotail Current Sheet Forma
Domrin V. I., Malova H. V., V. Yu. Popov et al., Cosmic Research 2026 Vol. 64 No. 2 P. 238–252
During magnetospheric perturbations a relatively thin current sheet with thickness about several proton gyroradii forms in the Earth’s magnetotail. In a framework of the kinetic model describing current sheet thinning in the magnetotail, the processes of its formation are investigated depending on the normal magnetic field magnitude which affects both the current sheet structure and particle dynamics within ...
Added: April 27, 2026
Asymmetric Equilibrium Structures of Superthin Current Sheets: The Asymmetry of Plasma Sources
Tsareva O. O., Malova H. V., V. Yu. Popov et al., Plasma Physics Reports 2026 Vol. 52 No. 2 P. 179–185
The influence of asymmetry of plasma sources on the structure and spatial localization of a superthin current sheet (STCS) supported by demagnetized electrons is studied using a self-consistent model. The simulation takes into account the presence of a single plasma source in the northern hemisphere, which makes the plasma flow asymmetric. It is demonstrated that the asymmetry of ...
Added: April 27, 2026
WWW '26: The ACM Web Conference 2026
NY: Association for Computing Machinery (ACM), 2026.
It is our great pleasure to welcome you to the 35th edition of the Web Conference to be held on June 29 – July 3, 2026, in Dubai, United Arab Emirates. Following discussions with our partners and key stakeholders, we have taken the decision to postpone the ACM Web Conference 2026, initially planned for April 2026. ...
Added: April 23, 2026
Разработка микросервиса ADP для идентификации источников выбросов на основе машинного обучения с подкреплением
Kychkin A., Chernitsin I., Прикладная информатика 2026 Т. 21 № 1 С. 40–58
The results of the development of a software microservice embedded in atmospheric air quality monitoring systems to support the identification of industrial pollution sources are presented. The emission and subsequent spread of harmful substances in the lower layers of the atmosphere is dynamic and characterized by high uncertainty due to the specific features of technological ...
Added: April 23, 2026
2026 International Conference on Artificial Intelligence, Computer, Data Sciences and Applications (ACDSA)
IEEE, 2026.
Added: April 21, 2026
What Drives Multi-Chain Crypto Forecasting: Model Choice, Feature Selection, and Transferability
Wang M., Xiao Y., Braslavski P. et al., Mathematics 2026 Vol. 14 No. 8 Article 1286
Increasingly shaped by heterogeneous on-chain activity rather than a single shared market process, this study investigates 7-day-ahead forecasting using 147 market and on-chain indicators across eight major blockchain ecosystems from October 2023 to April 2025. We benchmark statistical, deep-learning, and foundation-model baselines under multiple feature-selection pipelines using both error metrics and Diebold–Mariano tests. TiRex achieves ...
Added: April 20, 2026
Cross-influence of two societies in deterministic evolutionary game
Shchur L., Antonov D., Burovski E., International Journal of Bifurcation and Chaos in Applied Sciences and Engineering 2026 P. 1–9
We present a simple model that simulates the possible influence of one society on another. Specifically, two societies evolve deterministically according to the well-known Nowak-May spatial game with the addition of mutual influence through connections that reflect the current states of the societies. This may be related to the influence of a global information resource ...
Added: April 20, 2026
Проектирование сети Интернета вещей на основе многокритериальной оптимизации и информационного моделирования здания
Ebraheem A., Информационные процессы 2025 Т. 25 № 4 С. 787–798
The article proposes a method for planning the placement of access points and gateways inside buildings for constructing Internet of Things networks. The basis of the method is the use of information from a building information model, which makes it possible to easily take into account both the geometry and the physical and technical characteristics ...
Added: April 19, 2026
Modeling cosolvent effects on solubility in supercritical CO2 using data-driven approaches
Makarov D. M., Kalikin N., Gurikov P. et al., Journal of Supercritical Fluids 2026 Vol. 235 Article 106979
Supercritical CO2 (scCO2 ) is an environmentally friendly solvent, but its low polarity limits the solubility of polar compounds. Cosolvents are commonly used to enhance solvation capability, yet comprehensive datadriven studies are scarce. We compiled the largest dataset to date — 4401 experimental solubility records with 22 cosolvents for 93 nonionic solutes, plus 4855 records ...
Added: April 19, 2026
2026 28th International Conference on Digital Signal Processing and its Applications (DSPA)
IEEE, 2026.
A.S. Popov Russian Science and Technical Society with support from V. A. Trapeznikov Institute of Control Sciences, V.A. Kotelnikov Institute of Radio Engineering and Electronics, Autex Ltd. is leading the ХХVIII International Conference «Digital Signal Processing and its Applications — DSPA-2026» ...
Added: April 18, 2026
Построение системы опережающих индикаторов для прогнозирования валютного кризиса
Shchepeleva M., Финансы: теория и практика 2025 Т. 29 № 4 С. 146–162
This research is devoted to the analysis of financial crises. We examine different classifications of crises, methods of forecasting, approaches to building systems of early warning indicators. To better understand the potential for predicting financial crises, we conduct our own empirical research, comparing Logit model and random forest to predict currency crises in developing countries. ...
Added: February 12, 2026
Классификации и классификаторы в науке и аналитике
Isakov V., Юридическая техника 2024 № 18 С. 17–31
This consultation is devoted to two closely interrelated issues: the first part examines the logical and methodological foundations of the classification approach in analytics, the second part applies this approach to the analytics itself as an object of classification, considers its types and types. ...
Added: December 15, 2025
Automated Identification of Business Models
Milei P., Votintseva N., Barajas A., Information Processing and Management 2025 Vol. 62 No. 1 Article 103893
As business data grows in volume and complexity, there is an increasing demand for efficient, accurate, and scalable methods to analyse and classify business models. This study introduces and validates a novel approach for the automated identification of business models through content analysis of company reports. Our method builds on the semantic operationalisation of the ...
Added: September 22, 2025
Прогнозирование цен на золото с использованием алгоритмов нейросетей
Soldatova A., Финансы, деньги, инвестиции 2023 № 4 С. 9–15
The price of gold is the most important economic indicator. Expectations of rising inflation and higher key rates from central banks are driving investor interest in gold around the world. Given the increasing number of factors influencing the dynamics of the gold rate in the world, forecasting gold prices requires new methods and modern technological ...
Added: July 8, 2025
Predicting Systemic Risk in the Russian Financial Sector with Boosting Techniques
Shchepeleva M., Procedia Computer Science 2024 Vol. 242 P. 51–56
We test the predictive performance of different ensemble methods for forecasting systemic risk in Russia for the period 2008-2024. In contrast to the existing research on machine learning ensemble techniques, we find that conventional random forest works better for the Russian data. Based on this model, we additionally conduct variable importance analysis. We identify that ...
Added: June 17, 2025
Forecasting Stadium Attendance Using Machine Learning Models: A Case of the National Football League
Пан Ю., Wang F., Studia Sportiva 2024 Vol. 18 No. 2 P. 147–164
Added: May 16, 2025
Thermodynamic Parameters of Khubsugul Mountain Forests (Khordol-Sardag, Mongolia)
R. B. Sandlerskiy, Petrzhik N. M., Jargalsaikhan T. et al., Biology Bulletin 2023 Vol. 50 No. S2 P. S226–S238
The results of using a thermodynamic approach to study the functioning of mountain forest biogeocenoses based on Landsat 8 OLI TIRS multispectral scanner survey for the landscapes of the northwestern Khubsugul region are presented. Using the example of a section of the Khordol-Sardag ridge, the spatiotemporal variation of thermodynamic characteristics calculated within the framework of ...
Added: February 26, 2025
Thermodynamic Properties of Landscape Cover
Robert Sandlersky, , in: Reference Module in Earth Systems and Environmental Sciences.: Oxford: Elsevier, 2025. P. 1–11.
Added: February 19, 2025
Сравнение ансамблевых и корреляционных графов в задаче классификации состояний мозга на основе фМРТ данных
Vlasenko D., Ушаков В. Г., Zaikin A. et al., Известия высших учебных заведений. Прикладная нелинейная динамика 2025 Т. 33 № 4 С. 557–566
The study of functional brain networks that support cognitive processes is one of the central goals of modern neuroscience. Functional magnetic resonance imaging (fMRI) is widely used to obtain data on brain activity. However, the high dimensionality and dynamic nature of fMRI data makes their processing challenging. Network-based methods of data representation offer a promising approach to describe ...
Added: January 14, 2025
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit