Machine Learning for Subgroup Discovery under Treatment Effect

А. В. Бузмаков

?

Machine Learning for Subgroup Discovery under Treatment Effect

Cornell University , 2019. № arXiv:1902.10327.

In many practical tasks it is needed to estimates effect of a treatemnt on individual level. For example in medicine it is essential to determine the patients that would benifit from a certain medicament. In marketing knowning the persons that are likely to buy a new product would reduce the amount of spam. In this chapter we review the methods to estimate individulize treatment effect from a randomized trial, i.e., an experiment when a part of individuals recieves a new treatment, while the others does not. Finally, it is shown that new efficient methods are needed in this domain.

Research target: Computer Science

Priority areas: IT and mathematics

Language: Russian

Full text

Keywords: машинное обучение ансамблевые методы machine learning ensemble methods randomized controlled trials treatment effect estimation Оценка эффекта от воздействия Рандомизированный эксперимент

Ансамблевый метод машинного обучения, основанный на рекомендации классификаторов

Kashnitsky Y., Ignatov D. I., Интеллектуальные системы. Теория и приложения 2015 Т. 19 № 4 С. 37–55

The paper makes a brief introduction into multiple classifier systems and describes a particular algorithm which improves classification accuracy by making a recommendation of an algorithm to an object. This recommendation is done under a hypothesis that a classifier is likely to predict the label of the object correctly if it has correctly classified its ...

Added: December 7, 2015

МАШИННОЕ ОБУЧЕНИЕ В ИССЛЕДОВАНИЯХ МЕДИКО-БИОЛОГИЧЕСКИХ И СОЦИАЛЬНО-ЭКОНОМИЧЕСКИХ ДАННЫХ

Buzmakov A. V., В кн.: МАШИННОЕ ОБУЧЕНИЕ В ИССЛЕДОВАНИЯХ МЕДИКО-БИОЛОГИЧЕСКИХ И СОЦИАЛЬНО-ЭКОНОМИЧЕСКИХ ДАННЫХ. СПб.: Федеральное государственное автономное образовательное учреждение высшего образования "Санкт-Петербургский политехнический университет Петра Великого", 2020. С. 284–333.

In many practical tasks it is needed to estimate an effect of treatment on individual level. For example, in medicine it is essential to determine the patients that would benefit from a certain medicament. In marketing, knowing the persons that are likely to buy a new product would reduce the amount of spam. In this ...

Added: December 7, 2021

Proceedings of the Fifthteenth International Conference on Concept Lattices and Their Applications

CEUR-WS.org, 2020.

The CLA conference is an international forum for researchers, practitioners and students dedicated to the practice of Formal Concept Analysis (FCA) and areas closely related to it, including data analysis and mining, information retrieval, knowledge management, knowledge engineering, logic, algebra and lattice theory. The 15th of CLA, CLA 2020, was going to be held in Tallinn, Estonia ...

Added: October 30, 2020

Fairness of Machine Learning Algorithms in Demography

Emmanuel I. C., Mitrofanova E., / Series 4064475 "ArXiv Preprint". 2022.

The paper is devoted to the study of the model fairness and process fairness of the Russian demographic dataset by making predictions of divorce of the 1st marriage, religiosity, 1st employment and completion of education. Our goal was to make classifiers more equitable by reducing their reliance on sensitive features while increasing or at least ...

Added: May 31, 2022

Supplementary Proceedings of the 3rd International Conference on Analysis of Images, Social Networks and Texts (AIST 2014)

Ekaterinburg: CEUR Workshop Proceedings, 2014.

AIST'2014 is an international data science conference on Analysis of Images, Social Networks, and Texts. Traditionally, the conference is held annually in Yekaterinburg, Russia. The conference is intended for computer scientists and practitioners whose research interests involve Internet mathematics and other related fields of data science. LIST OF TOPICS (NON EXHAUSTIVE) Applications of Data Mining and Machine ...

Added: August 28, 2014

Применение методов машинного обучения для решения задачи автоматической рубрикации статей по УДК

Romanov A., Ломотин К. Е., Козлова Е. С., Информационные технологии 2017 Т. 23 № 6 С. 418–423

The paper deals with the applicability of modern machine learning methods to the problem of automatic generation of UDC for scientific articles. As the classifiers, such models as artificial neural networks, logistic regression and boosting are considered. Graph algorithms and a prototype software module to generate UDC are designed. ...

Added: July 30, 2017

Разработка интеллектуального голосового ассистента и исследование обучающей способности алгоритмов распознавания естественного языка

Polyakov E. V., Мажанов М. С., Качалова М. В. et al., Системный администратор 2017 № 12 С. 80–85

The development of cognitive technologies contributes to the effective introduction of Artificial Intelligence into the everyday life of a person. New interfaces for device-human interaction appear. Understanding the natural language of human is one of the most promising areas of the development of Artificial Intelligence. Voice assistants are a striking example of such systems, they ...

Added: December 10, 2017

Formal Concept Analysis: 16th International Conference, ICFCA 2021, Strasbourg, France, June 29 – July 2, 2021, Proceedings

Springer, 2021.

This book constitutes the proceedings of the 16th International Conference on Formal Concept Analysis, ICFCA 2021, held in Strasbourg, France, in June/July 2021. The 14 full papers and 5 short papers presented in this volume were carefully reviewed and selected from 32 submissions. The book also contains four invited contributions in full paper length. The research part ...

Added: July 10, 2021

Proceedings 2018 Global Smart Industry Conference (GloSIC)

Chelyabinsk: IEEE, 2018.

The 2018 Global Smart Industry Conference is organized in order to exchange experience, promote discussion and presentation of research papers, and summarize results in development of innovative models, methods and technologies for the digital industry in universities, scientific and industrial associations of the Russian Federation as well as in foreign companies, and the experience of ...

Added: November 25, 2019

Методы борьбы с омонимией

Рысаков С. В., Системный администратор 2015 № 10(155) С. 92–95

The article provides a review of modern methods of morphological ambiguity resolution. We considered such methods as statistical disambiguation, Brill’s automatically generated rules, decision trees and their modifications. For the comparison, the article provides numerical results obtained on two open corpora: OpenCorpora and SynTagRus. ...

Added: November 25, 2015

Математические основы машинного обучения и прогнозирования

V'yugin V., М.: МЦНМО, 2013.

Книга предназначена для первоначлаьного знакомства с математическими основами современной теории машинного обучения (Machine Learning) и теории игр на предсказания. В первой части излагаются основы статистической теории машинного обучения, рассматриваются задачи классификации и регрессии с опорными векторами, теория обобщения и алгоритмы построения разделяющих гиперплоскостей. Во второй и третьей частях рассматриваются задачи адаптивного прогнозирования в нестохастических теоретико-игровой ...

Added: July 9, 2014

Оптимизация содержания седиментов в процессе гидрокрекинга гудрона с использованием методов машинного обучения

Нужный А. С., Однолько И. С., Глухов А. Ю. et al., Прикладная математика и вопросы управления 2021 № 1 С. 7–22

The paper proposes a mathematical model to optimize the operation of the tar hydrocracking unit. The purpose of modeling is to improve the economic effect of product output by selecting optimal parameters, such as hydrogen flow rate and reactor temperature. Hot Filtered Precipitation (HFT) is used as a target. The model involves the search for the minimum value ...

Added: April 11, 2021

Universality classes and machine learning

Chertenkov V., Shchur L., Journal of Physics: Conference Series 2021 Vol. 1740 P. 1–5

We formulate the problem of the universality class investigation using machine learning. We chose an example of the universality class of the two-dimensional 4-state Potts model. There are four known models within the universality class – the 4-state Potts model, the Baxter-Wu model, the Ashkin-Teller model, and the Turban model. All four of them together ...

Added: February 19, 2021

Machine learning application for support for automated control systems users

Хромов С. К., Кулагин М. А., Sidorenko V., Journal of Physics: Conference Series 2020 No. 1680 (1) Article 012019

The article presents the results of the analysis of determining the possibility of Machine Learning (ML) using for solving the problems of incident classification of users on the example of enterprise resource planning (ERP) systems of JSC Russian Railways and choosing a rational method for solving this problem. The presented problem is a special case ...

Added: April 16, 2021

9th Russian Summer School in Information Retrieval (RuSSIR 2015)

Braslavski P. undefined., Markov I., Pardalos P. M. et al., ACM SIGIR Forum 2016 Vol. 49 No. 2 P. 72–79

This paper provides the reader with a report on 9th Russian Summer School in Information Retrieval (RuSSIR 2015). ...

Added: February 27, 2017

Proceedings of IEEE International Russian Automation Conference (RusAutoCon 2020)

IEEE, 2020.

Added: October 3, 2020

Использование метода главных компонент для анализа надежности цепей поставок

Kuznetsov V. O., Логистика и управление цепями поставок 2018 № 4 (87) С. 27–33

One of the options for a more flexible approach to analyzing the reliability of supply chains is the principal component analysis (PCA). With a large number of variables describing supply chain, it is a difficult task to analyze the structure of variables in two-dimensional space. Within the analysis of the variables dependencies PCA allows to ...

Added: November 29, 2018

Faster variational inducing input Gaussian process classification

Izmailov P., Kropotov D., Journal of machine learning and data analysis 2017 Vol. 3 No. 1 P. 20–35

Background: Gaussian processes (GP) provide an elegant and effective approach to learning in kernel machines. This approach leads to a highly interpretable model and allows using the Bayesian framework for model adaptation and incorporating the prior knowledge about the problem. The GP framework is successfully applied to regression, classification, and dimensionality reduction problems. Unfortunately, the ...

Added: December 6, 2018

Analysis of Images, Social Networks and Texts Third International Conference, AIST 2014, Yekaterinburg, Russia, April 10-12, 2014, Revised Selected Papers

Berlin: Springer, 2014.

This book constitutes the proceedings of the Third International Conference on Analysis of Images, Social Networks and Texts, AIST 2014, held in Yekaterinburg, Russia, in April 2014. The 11 full and 10 short papers were carefully reviewed and selected from 74 submissions. They are presented together with 3 short industrial papers, 4 invited papers and ...

Added: November 13, 2014

2019 International Russian Automation Conference (RusAutoCon)

IEEE, 2019.

Added: October 21, 2019

Texterra: инфраструктура для анализа текстов

Денис Турдаков, Астраханцев Н. А., Недумов Я. Р. et al., Труды Института системного программирования РАН 2014 Т. 26 С. 421–438

he paper presents a framework for fast text analytics developed during the Texterra project. Texterra is a technology for multilingual text mining based on novel text processing methods that exploit knowledge extracted from user-generated content. It delivers a fast scalable solution for text mining without the expensive customization. Depending on use-cases Texterra could be utilized ...

Added: November 6, 2017

Machine-Learning for electro-magnetic showers reconstruction in emulsion cloud chambers

V.Belavin, A.Filatov, A.Ustyuzhanin et al., Journal of Physics: Conference Series 2018 Vol. 1085 No. 4 P. 042025-1–042025-6

Traces of electro-magnetic showers in the neutrino experiments may be considered as signals of dark-matter particles. For example, SHiP experiment is going to use emulsion film detectors similar to the ones designed for OPERA experiment from dark matter search. The goal of this research is to develop an algorithm that can identify traces of electro-magnetic ...

Added: December 8, 2017

Breaking Sticks and Ambiguities with Adaptive Skip-gram

Bartunov S., Кондрашкин Д. А., Osokin A. et al., / Series arXiv:1502.07257 "Computation and language". 2015.

Recently proposed Skip-gram model is a powerful method for learning high-dimensional word representations that capture rich semantic relationships between words. However, Skip-gram as well as most prior work on learning word representations does not take into account word ambiguity and maintain only single representation per word. Although a number of Skip-gram modifications were proposed to ...

Added: November 5, 2015

Использование искусственного интеллекта в вопросах выявления и противодействия коррупции: обзор международного опыта

Krylova D., Maksimenko A., Государственное управление. Электронный вестник 2021 № 84 С. 241–255

In this article, the authors, using the example of several foreign publications, analyze the trends in the use of artificial intelligence and machine learning in discernment of corruption. Based on the international review, the authors make the conclusion that the mechanisms for detecting corruption, based on the use of artificial intelligence, described in foreign sources, ...

Added: February 25, 2021