Variance Networks: When Expectation Does Not Meet Your Expectations

K. O. Neklyudov; D. Molchanov; A. Ashukha; D. Vetrov

АБВ
АБВ
АБВ

Обычная версия сайта

Priority areas

by year

Subject

News

July 9, 2024

'I've Always Been Keen to Engage in Experiments and Operate Scientific Instruments'

During his early years at university, physicist Ivan Makhov worried that he might be dismissed, but today he is heading a study supported by a grant from the Russian Science Foundation. In this interview with the HSE Young Scientists project, he shares his work experience using a closed-loop cryostat, his dream of conversing with Einstein, and favourite location in his hometown of St Petersburg.

July 4, 2024

Reinforcement Learning Enhances Performance of Generative Flow Networks

Scientists at the AI Research Centre and the AI and Digital Science Institute of the HSE Faculty of Computer Science applied classical reinforcement learning algorithms to train generative flow networks (GFlowNets). This enabled significant performance improvements in GFlowNets, which have been employed for three years in tackling the most complex scientific challenges at modelling, hypothesis generation, and experimental design stages. The results of their work achieved a top 5% ranking among publications at the International Conference on Artificial Intelligence and Statistics AISTATS, held on May 2-4, 2024, in Valencia, Spain.

July 3, 2024

‘I Came Up with the Idea to Create an Application Useful for Practicing Physicians

Dmitry Ryabtsev, a 2024 graduate of the master's programme at the HSE Faculty of Computer Science, created an AI-powered software service for ophthalmology during his two years of study. This product is now entering the market, and its developer plans to participate in establishing a working group on software engineering for medical applications at the HSE Faculty of Computer Science, with the goal of promoting more genuinely useful domestic projects. In an interview with HSE News Service, Dr Ryabtsev shared his story of how a professional doctor turned into a programmer.

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications

?

Variance Networks: When Expectation Does Not Meet Your Expectations

P. 1-16.

Neklyudov K. O., Molchanov D., Ashukha A., Vetrov D.

Ordinary stochastic neural networks mostly rely on the expected values of their weights to make predictions, whereas the induced noise is mostly used to capture the uncertainty, prevent overfitting and slightly boost the performance through test-time averaging. In this paper, we introduce variance layers, a different kind of stochastic layers. Each weight of a variance layer follows a zero-mean distribution and is only parameterized by its variance. We show that such layers can learn surprisingly well, can serve as an efficient exploration tool in reinforcement learning tasks and provide a decent defense against adversarial attacks. We also show that a number of conventional Bayesian neural networks naturally converge to such zero-mean posteriors. We observe that in these cases such zero-mean parameterization leads to a much better training objective than conventional parameterizations where the mean is being learned.

Language: English

Full text

Text on another site

Keywords: bayesian neural networks

In book

Proceedings of the 7th International Conference on Learning Representations (ICLR 2019)

ICLR, 2019

Variational Dropout via Empirical Bayes

Kharitonov V., Molchanov D., Vetrov D., / Cornell University. Series arxiv.org "stat.ML". 2018.

We study the Automatic Relevance Determination procedure applied to deep neural networks. We show that ARD applied to Bayesian DNNs with Gaussian approximate posterior distributions leads to a variational bound similar to that of variational dropout, and in the case of a fixed dropout rate, objectives are exactly the same. Experimental results show that the ...

Added: November 27, 2018