?
Attribution of Customers’ Actions Based on Machine Learning Approach
P. 77-88.
Timur Kadyrov, Ignatov D. I.
A multichannel attribution model based on gradient boost- ing over trees is proposed, which was compared with the state of the art models: bagged logistic regression, Markov chains approach, shapely value. Experiments on digital advertising datasets showed that the pro- posed model is better than the solutions considered by ROC AUC metric. In addition, the problem of probability prediction of conversion by the consumer using the ensemble of the analyzed algorithms was solved, the meta-features obtained were enriched with consumers and offline activities of the advertising campaign data.
Keywords: цифровая рекламаGradient boostingградиентный бустингMulti-touch attributionDigital advertisingData-driven marketingМногоканальная атрибуция рекламымаркетинг на основе данных
Publication based on the results of:
In book
Vol. 2479. , CEUR Workshop Proceedings, 2019
Kuralenok I., Ershov V., Лабутин И. Н., , in : Advances in Neural Information Processing Systems 32 (NeurIPS 2019). : [б.и.], 2019. P. 1-10.
In this work, we introduce a new decision tree ensemble representation framework: instead of using a graph model we transform each tree into a well-known polynomial form. We apply the new representation to three tasks: theoretical analysis, model reduction, and interpretation. The polynomial form of a tree ensemble allows a straightforward interpretation of the original ...
Added: December 27, 2019
Bukina T. V., Kashin D., Экономический журнал Высшей школы экономики 2024 Т. 28 № 1 С. 81-107
The paper reveals the forecasts for regional inflation based on the regions of the Privolzhskiy Federal District (PFD). The purpose of the study is to determine the model that most accurately predicts regional inflation. The paper compares the tools of machine learning – support vector machines, gradient boosting, and random forest – with econometric models ...
Added: February 13, 2024
Нужный А. С., Стариков А. С., Левченко Е. Н. et al., Нефтехимия 2022
The problem of constructing virtual analyzers of laboratory indicators for light fractions of hydrocracking of oil residues is considered. In the course of performing this task, the main indicators of the state of the installation associated with the simulated laboratory values were identified. The task of constructing virtual analyzers was reduced to the classical task ...
Added: March 31, 2022
Bulychev A., Сомов О. Д., В кн. : Информатика, управление и системный анализ: Труды V Всероссийской научной конференции молодых ученых с международным участием. : Ростов н/Д : Ростовский государственный экономический университет "РИНХ", 2018. С. 94-102.
In the process of developing an information system for logistics transportation, there is a need to determine the initial rating of the new carrier within the parent company. The presence of the rating helps to more accurately carry out the formation of orders and build forecasts of its interaction with the parent company in the ...
Added: September 3, 2019
Shurmina I., Interactive Entertainment Law Review 2020 No. 3 P. 59-66
Added: November 16, 2021
Малинин А. А., Prokhorenkova L., Ustimenko A., , in : Proceedings of the 9th International Conference on Learning Representations (ICLR 2021). ICLR, 2021. : ICLR, 2021.
Added: August 2, 2021
Kulagin M., Sidorenko V., , in : Proceedings of the Third International Scientific Conference “Intelligent Information Technologies for Industry” (IITI’18) Volume 2. Vol. 2.: Springer, 2019. P. 308-316.
This article describes modern methods of data processing regarding the task of assessing activities of transportation employees. The main purpose was to find dependencies in data and construct an algorithm for predicting the probability of transport safety violation by employee. The research was conducted for locomotive drivers. The following algorithms were used: neural networks, gradient boosting over decision trees and ...
Added: January 19, 2019
Antipov E. A., Pokryshevskaya E. B., Journal of Revenue and Pricing Management 2020 No. 19 P. 355-364
Forecasting demand and understanding sales drivers are one of the most important tasks in retail analytics. However, traditionally, linear models and/or models with a small number of predictors have been predominantly used in sales modeling. Taking into account that real-world demand is naturally determined by complex substitution and complementation patterns among a large number of ...
Added: October 31, 2020
Muratova A., Mitrofanova E., Islam R., , in : Procedia Computer Science: 11th International Young Scientist Conference on Computational Science. Vol. 212.: Elsevier, 2022. P. 358-367.
The article presents a case study on demographic sequences analysis through modern machine learning (ML)
techniques. The studied data contains demographic and socioeconomic events, where the events are presented
as sequences of statuses. The involved demographers are interested in applications of advanced ML techniques
and interpretable patterns for their needs. We show how Shapley value-based explanations can be ...
Added: September 10, 2022
Folberth C., Baklanov A., Balkovič J. et al., Agricultural and Forest Meteorology 2019 Vol. 264 P. 1-15
Global gridded crop models (GGCMs) are essential tools for estimating agricultural crop yields and externalities at large scales, typically at coarse spatial resolutions. Higher resolution estimates are required for robust agricultural assessments at regional and local scales, where the applicability of GGCMs is often limited by low data availability and high computational demand. An approach ...
Added: January 23, 2019
Liudmila Prokhorenkova, Gusev G., Vorobev A. et al., , in : Advances in Neural Information Processing Systems 31 (NeurIPS 2018). : Neural Information Processing Systems Foundation, 2018. P. 6638-6648.
This paper presents the key algorithmic techniques behind CatBoost, a new gradient boosting toolkit. Their combination leads to CatBoost outperforming other publicly available boosting implementations in terms of quality on a variety of datasets. Two critical algorithmic advances introduced in CatBoost are the implementation of ordered boosting, a permutation-driven alternative to the classic algorithm, and ...
Added: May 1, 2020
Соколов А. П., Прохоренкова Л. А., Интеллектуальные системы. Теория и приложения 2023 Т. 27 № 1 С. 18-23
Решающие деревья широко применяются в машинном обуче
нии, статистике и анализе данных. Предиктивные модели, осно
ванные на решающих деревьях, показывают отличные результаты
в терминах точности и времени обучения, особенно на гетерогенных
табличных датасетах. Производительность, простота и надежность
делают это семейство алгоритмов одним из наиболее популярных
в машинном обучении и науке о данных.
Одним из важных гиперпараметров алгоритмов, основанных на
решающих деревьях, является максимальная ...
Added: February 11, 2024
Ustimenko A., Prokhorenkova L., , in : Proceedings of the 38th International Conference on Machine Learning (ICML 2021). Vol. 139.: PMLR, 2021. P. 1-10.
This paper introduces Stochastic Gradient Langevin Boosting (SGLB) - a powerful and efficient machine learning framework that may deal with a wide range of loss functions and has provable generalization guarantees. The method is based on a special form of the Langevin diffusion equation specifically designed for gradient boosting. This allows us to theoretically guarantee ...
Added: August 6, 2021
Нужный А. С., Глухов А. Ю., Левченко Е. Н. et al., Нефтехимия 2021
Рассматривается задача построения виртуальных анализаторов лабораторных показателей для легких фракций гидрокрекинга нефтяных остатков. В ходе выполнения данной задачи были выделены основные показатели состояния установки, связанные с моделируемыми лабораторными величинами. Задача построения виртуальных анализаторов была сведена к классической задаче «обучения с учителем» путем смещения показателей установки на правильно подобранные временные лаги. Рассматривается подход к оценке значений ...
Added: April 11, 2021
Ivanov S., Prokhorenkova L., , in : Proceedings of the 9th International Conference on Learning Representations (ICLR 2021). ICLR, 2021. : ICLR, 2021.
Added: August 2, 2021
Alexandrovskiy S., Trundova Olga, International Journal of Internet Marketing and Advertising 2022 Vol. 16 No. 1/2 P. 19-37
The authors aim to offer a probability approach for measuring media contribution to online conversions in e-commerce. The authors reviewed literature on attribution modelling with application of heuristics (Google Analytics) and probability (Markov chains) models. The survey used the data of customer journeys from 134132 users to build up the attribution model. As a result, ...
Added: March 10, 2020
Liudmila Prokhorenkova, Ustimenko A., , in : International Conference on Machine Learning (ICML 2020). Vol. 119.: PMLR, 2020. P. 9669-9679.
In this paper, we introduce a powerful and efficient framework for direct optimization of ranking metrics. The problem is ill-posed due to the discrete structure of the loss, and to deal with that, we introduce two important techniques: stochastic smoothing and novel gradient estimate based on partial integration. We show that classic smoothing approaches may ...
Added: January 14, 2021