Reverse kl-divergence training of prior networks: Improved uncertainty and adversarial robustness

?

Reverse kl-divergence training of prior networks: Improved uncertainty and adversarial robustness

Cornell University , 2019.

Малинин А. А., Gales M.

Ensemble approaches for uncertainty estimation have recently been applied to the tasks of misclassification detection, out-of-distribution input detection and adversarial attack detection. Prior Networks have been proposed as an approach to efficiently \emph{emulate} an ensemble of models for classification by parameterising a Dirichlet prior distribution over output distributions. These models have been shown to outperform alternative ensemble approaches, such as Monte-Carlo Dropout, on the task of out-of-distribution input detection. However, scaling Prior Networks to complex datasets with many classes is difficult using the training criteria originally proposed. This paper makes two contributions. First, we show that the appropriate training criterion for Prior Networks is the \emph{reverse} KL-divergence between Dirichlet distributions. This addresses issues in the nature of the training data target distributions, enabling prior networks to be successfully trained on classification tasks with arbitrarily many classes, as well as improving out-of-distribution detection performance. Second, taking advantage of this new training criterion, this paper investigates using Prior Networks to detect adversarial attacks and proposes a generalized form of adversarial training. It is shown that the construction of successful \emph{adaptive} whitebox attacks, which affect the prediction and evade detection, against Prior Networks trained on CIFAR-10 and CIFAR-100 using the proposed approach requires a greater amount of computational effort than against networks defended using standard adversarial training or MC-dropout.

Language: English

Text on another site

Keywords: machine learning

Proceedings of the 2023 CLASP Conference on Learning with Small Data (LSD)

Gothenburg : Association for Computational Linguistics, 2023

Current deep learning systems require large amounts of data in order to yield optimal results. Despite ever-increasing model and data size, these systems have achieved remarkable success across a wide range of tasks in NLP, and AI in general. However, these systems possess a number of limitations. Firstly, the models require a significant amount of ...

Added: December 5, 2023

Modeling global real economic activity: Evidence from variable selection across quantiles

Stolbov M., Shchepeleva M., Journal of Economic Asymmetries 2022 Vol. 25 Article e00238

We conduct an open search of predictors of global real economic activity. To this end, we apply a predictive quantile regression framework, using four alternative proxies of global real economic activity during February 1997–August 2019 and building on a combination of machine learning algorithms to identify their predictors out of 23 candidate explanatory variables. The contemporaneous level ...

Added: February 10, 2022

ПРОЕКТНОЕ ПРЕДЛОЖЕНИЕ: АВТОМАТИЗИРОВАННЫЙ ПОДХОД К РЕКОМЕНДАТЕЛЬНЫМ СИСТЕМАМ

Сендерович М. А., В кн. : Межвузовская научно-техническая конференция студентов, аспирантов и молодых специалистов им. Е.В. Арменского. : М. : МИЭМ НИУ ВШЭ, 2019. С. 223-224.

Данная работа посвящена актуальной теме автоматизации в машинном обучении на примере создания универсальной рекомендательной системы. В работе исследуются различные типы рекомендательных систем, акцент делается на подходы коллаборативной фильтрации. Изучаются методы автоматизации машинного обучения, на основе которых будет разработана данная рекомендательная система. ...

Added: October 31, 2020

МАШИННОЕ ОБУЧЕНИЕ В ИССЛЕДОВАНИЯХ МЕДИКО-БИОЛОГИЧЕСКИХ И СОЦИАЛЬНО-ЭКОНОМИЧЕСКИХ ДАННЫХ

Buzmakov A. V., В кн. : МАШИННОЕ ОБУЧЕНИЕ В ИССЛЕДОВАНИЯХ МЕДИКО-БИОЛОГИЧЕСКИХ И СОЦИАЛЬНО-ЭКОНОМИЧЕСКИХ ДАННЫХ. : СПб. : Федеральное государственное автономное образовательное учреждение высшего образования "Санкт-Петербургский политехнический университет Петра Великого", 2020. С. 284-333.

In many practical tasks it is needed to estimate an effect of treatment on individual level. For example, in medicine it is essential to determine the patients that would benefit from a certain medicament. In marketing, knowing the persons that are likely to buy a new product would reduce the amount of spam. In this ...

Added: December 7, 2021

Proceedings of the International Workshop "What can FCA do for Artificial Intelligence?" (FCA4AI at ECAI 2014)

Prague : CEUR Workshop Proceedings, 2014

The first and the second edition of the FCA4AI Workshop showed that many researchers working in Artificial Intelligence are indeed interested by a well-founded method for classi- fication and mining such as Formal Concept Analysis (see http://www.fca4ai.hse.ru/). The first edition of FCA4AI was co-located with ECAI 2012 in Montpellier and published as http://ceur-ws.org/Vol-939/ while the ...

Added: September 12, 2014

Human knowledge models: Learning applied knowledge from the data

Dudyrev E., Semenkov Ilia, Kuznetsov S. et al., Plos One 2022 Vol. 17 No. 10 Article e0275814

Artificial intelligence and machine learning have demonstrated remarkable results in science and applied work. However, present AI models, developed to be run on computers but used in human-driven applications, create a visible disconnect between AI forms of processing and human ways of discovering and using knowledge. In this work, we introduce a new concept of ...

Added: October 29, 2022

Application of the Method of Multivariate Multi-stage Forecasting Based on the LSTM Deep Learning Model for Bitcoin Price Time Series

Natalia Sizykh, Said Dandamaev, Dmitry Sizykh, , in : 16th International Conference Management of large-scale system development (MLSD). : IEEE, 2023. P. 1-5.

Forecasting data and research on cryptocurrency price forecasting methods are increasing in importance. So far, methods based on LSTM deep learning architecture have shown the best results in forecasting cryptocurrency prices. In order to improve the accuracy of forecasting data, this paper investigates the application of a multivariate multistep forecasting method based on the LSTM ...

Added: December 22, 2023

Belief Functions: Theory and Applications

Dordrecht, L., Heidelberg, NY : Springer, 2014

This book constitutes the thoroughly refereed proceedings of the Third International Conference on Belief Functions, BELIEF 2014, held in Oxford, UK, in September 2014. The 47 revised full papers presented in this book were carefully selected and reviewed from 56 submissions. The papers are organized in topical sections on belief combination; machine learning; applications; theory; ...

Added: October 1, 2014

Style transfer in NLP: a framework and multilingual analysis with Friends TV series

Tikhonova M., Elina Telesheva, Mirzoev S. et al., , in : 2021 International Conference Engineering and Telecommunication (En&T). : IEEE, 2022. P. 1-6.

Style transfer is an important and a rapidly developing of Natural Language Processing. This days more and more methods and models are proposed which allow us to generate text in predefined style. In this paper we propose a framework for style transfer of “Friends” TV series. The trained models are able to mimic one of ...

Added: May 21, 2022

Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track

PMLR, 2022

Added: July 27, 2022

Hidden Feedback Loops in Machine Learning Systems: A Simulation Model and Preliminary Results

Anton Khritankov, , in : Software Quality: Future Perspectives on Software Engineering Quality: 13th International Conference, SWQD 2021, Vienna, Austria, January 19–21, 2021, Proceedings. : Springer, 2021. P. 54-65.

In this concept paper, we explore some of the aspects of quality of continuous learning artificial intelligence systems as they interact with and influence their environment. We study an important problem of implicit feedback loops that occurs in recommendation systems, web bulletins and price estimation systems. We demonstrate how feedback loops intervene with user behavior ...

Added: September 23, 2021

Learning and Intelligent Optimization. 8th International Conference, Lion 8, Gainesville, FL, USA, February 16-21, 2014. Revised Selected Papers

Springer, 2014

This book constitutes the thoroughly refereed post-conference proceedings of the 8th International Conference on Learning and Optimization, LION 8, which was held in Gainesville, FL, USA, in February 2014. The 33 contributions presented were carefully reviewed and selected for inclusion in this book. A large variety of topics are covered, such as algorithm configuration; multiobjective ...

Added: September 15, 2014

SimVec: predicting polypharmacy side effects for new drugs

Lukashina N., Kartysheva E., Spjuth O. et al., Journal of Cheminformatics 2022 Vol. 14 Article 49

Polypharmacy refers to the administration of multiple drugs on a daily basis. It has demonstrated effectiveness in treating many complex diseases , but it has a higher risk of adverse drug reactions. Hence, the prediction of polypharmacy side effects is an essential step in drug testing, especially for new drugs. This paper shows that the ...

Added: October 11, 2022

Использование метода главных компонент для анализа надежности цепей поставок

Kuznetsov V. O., Логистика и управление цепями поставок 2018 № 4 (87) С. 27-33

One of the options for a more flexible approach to analyzing the reliability of supply chains is the principal component analysis (PCA). With a large number of variables describing supply chain, it is a difficult task to analyze the structure of variables in two-dimensional space. Within the analysis of the variables dependencies PCA allows to ...

Added: November 29, 2018

Proceedings of the Fifth International Workshop on Experimental Economics and Machine Learning (EEML 2019),Perm, Russia, September 26, 2019

CEUR Workshop Proceedings, 2019

Proceedings of the Fifth Workshop on Experimental Economics and Machine Learning at the National Research Univeristy Higher School of Economics co-located with the Seventh International Conference on Applied Research in Economics (iCare7) ...

Added: October 23, 2019

Исследовательский проект как инструмент обучения методам анализа текста: предсказание класса поста в социальной сети

Suvorova A., Смирнова К. Р., Будин Е. А. et al., Компьютерные инструменты в образовании 2018 № 3 С. 49-64

The article describes a student research project on predicting the class of a post on a social network based on its textual content. The features of the project are discussed as an integral part of the trajectory of teaching data analysis methods, including text analysis methods and tools that are often not included in machine ...

Added: January 28, 2019

WIMS 2020: Proceedings of the 10th International Conference on Web Intelligence, Mining and Semantics

Association for Computing Machinery (ACM), 2020

On behalf of the conference chairs, we welcome you to the 10th International Conference on Web Intelligence, Mining and Semantics (WIMS'20) hosted by LIUPPA Lab of the University de Pau & Pays de l'Adour, Biarritz-France. The 1st WIMS conference was organized in Sogndal Norway. Since then, it has always been published by ACM. After 10 ...

Added: August 28, 2020

Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics

Кузнецов А. С., Shvechikov P., Grishin A. et al., , in : International Conference on Machine Learning (ICML 2020). Vol. 119.: PMLR, 2020. P. 5556-5566.

Added: October 17, 2020

Formal Concept Analysis: 16th International Conference, ICFCA 2021, Strasbourg, France, June 29 – July 2, 2021, Proceedings

Springer, 2021

This book constitutes the proceedings of the 16th International Conference on Formal Concept Analysis, ICFCA 2021, held in Strasbourg, France, in June/July 2021. The 14 full papers and 5 short papers presented in this volume were carefully reviewed and selected from 32 submissions. The book also contains four invited contributions in full paper length. The research part ...

Added: July 10, 2021

Predictive Analytics Approach for Steel Billets Quality Control System

Belov A. V., Ekaterina A. Melekhova, Vorontsova T., , in : 2022 International Conference on Quality Management, Transport and Information Security, Information Technologies (IT&QM&IS). : St. Petersburg : IEEE, 2022. P. 219-223.

The paper deals with the problem of improving the quality of metal products. Nowadays destructive methods of quality control of the steel billets prevail at metallurgical enterprises. This approach to assessing the quality of the steel billets is wasteful, which increases its cost. One of the ways to reduce the cost of production of metal ...

Added: January 28, 2023

Proceedings of the international conference on Uncertainty in Artificial Intelligence (UAI 2018)

[б.и.], 2018

Proceedings of the international conference on Uncertainty in Artificial Intelligence (UAI 2018) ...

Added: October 29, 2018

Application of Artificial Intelligence Methods for Improvement of Strategic Decision-Making in Logistics

Kitzmann H., Strimovskaya A., Serova E., , in : Transfer, Diffusion and Adoption of Next-Generation Digital Technologies. IFIP WG 8.6 International Working Conference on Transfer and Diffusion of IT, TDIT 2023 Nagpur, India, December 15–16, 2023 Proceedings, Part II. Vol. 698.: Springer, 2024. P. 132-143.

Highly evolving economic environment requires from logistics companies fast response and agile solutions. Recently development of digital technologies gives significant advantages to logistics business. Hence many optimized processes belong to operational management level. At the same time the importance of digital technologies adoption to strategic management level should not be underestimated, as it allows gaining competitive advantages alongside the supply chain. ...

Added: January 12, 2024

Search for CP violation through an amplitude analysis of D0 → K+K−π+π− decays

Derkach D., Hushchyn M., Kazeev N. et al., Journal of High Energy Physics 2019 Vol. 2019 No. 2 P. 1-33

A search for CP violation in the Cabibbo-suppressed D0 → K+K−π+π− decay mode is performed using an amplitude analysis. The measurement uses a sample of pp collisions recorded by the LHCb experiment during 2011 and 2012, corresponding to an integrated luminosity of 3.0 fb−1. The D0 mesons are reconstructed from semileptonic b-hadron decays into D0μ−X final states. The selected sample contains more than 160 000 signal decays, allowing ...

Added: March 17, 2019