Newton Method over Networks is Fast up to the Statistical Precision

Daneshmand A.; Scutari G.; P. Dvurechensky; A. Gasnikov

?

Newton Method over Networks is Fast up to the Statistical Precision

Ch. 139. P. 2398-2409.

Daneshmand A., Scutari G., Dvurechensky P., Gasnikov A.

We propose a distributed cubic regularization of the Newton method for solving (constrained) empirical risk minimization problems over a network of agents, modeled as undirected graph. The algorithm employs an inexact, preconditioned Newton step at each agent’s side: the gradient of the centralized loss is iteratively estimated via a gradienttracking consensus mechanism and the Hessian is subsampled over the local data sets. No Hessian matrices are thus exchanged over the network. We derive global complexity bounds for convex and strongly convex losses. Our analysis reveals an interesting interplay between sample and iteration/communication complexity: statistically accurate solutions are achievable roughly in the same number of iterations of the centralized cubic Newton, with a communication cost per iteration of the order of Oe 1/ √ 1 − ρ , where ρ characterizes the connectivity of the network. This represents a significant communication saving with respect to that of existing, statistically oblivious, distributed Newton-based methods over networks.

Language: English

Text on another site

Keywords: Newton method

Publication based on the results of:

Uncertainty quantification in machine learning algorithms (2021)

In book

Proceedings of the 38th International Conference on Machine Learning (ICML 2021)

Vol. 139. , PMLR, 2021

Randomized Block Cubic Newton Method

Doikov Nikita, Richtarik P., Proceedings of Machine Learning Research 2018 No. 80 P. 1290-1298

We study the problem of minimizing the sum of three convex functions: a differentiable, twice-differentiable and a non-smooth term in a high dimensional setting. To this effect we propose and analyze a randomized block cubic Newton (RBCN) method, which in each iteration builds a model of the objective function formed as the sum of the ...

Added: October 31, 2018

A Superlinearly-Convergent Proximal Newton-Type Method for the Optimization of Finite Sums

Rodomanov A., Kropotov D., Journal of Machine Learning Research 2016 Vol. 48 P. 2597-2605

We consider the problem of optimizing the strongly convex sum of a finite number of convex functions. Standard algorithms for solving this problem in the class of incremental/stochastic methods have at most a linear convergence rate. We propose a new incremental method whose convergence rate is superlinear – the Newton-type incremental method (NIM). The idea ...

Added: March 11, 2017

Формирование интегрального рейтинга с помощью статистической обработки результатов тестов

Кибзун А. И., Panarin S. I., Автоматика и телемеханика 2012 № 6 С. 119-139

The problem of building the rating of a remote training system by processing the results of a run of tests was considered. The Rasch model extended to a run of tests was used. A recurrent algorithm based on the maximum-likelihood procedure and the Newton method was proposed to calculate the rating. ...

Added: December 5, 2013

On the description of parabolic Newton maps

Mamayusupov K., / Cornell University. Series arXiv "math". 2019.

A description of rational Newton maps in terms of the partial fraction decomposition of rational functions is obtained. Dynamics on parabolic immediate basins for rational Newton maps of entire functions have been studied. It is proved that every parabolic immediate basin contains invariant accesses to the parabolic fixed point at infinity. Moreover, among these accesses ...

Added: February 4, 2019

Повышение эффективности обучения студентов аэрокосмических специальностей с помощью специализированного рейтинга

Panarin S. I., Труды МАИ 2011 № 44 С. 5-25

In aerospace industry one of the main issues is the problem of the qualified specialists education. During the learning process positive incentives improve the effectiveness of the education . One of such incentives is the rating system. In this work the construction and evaluation of the specialized rating system is regarded with examples on the ...

Added: December 5, 2013

Generation of integral rating by statistical processing of the test results

Kibzun A. I., Panarin S. I., Automation and Remote Control 2012 Vol. 73 No. 6 P. 1029-1045

Added: December 5, 2013

Newton maps of complex exponential functions and parabolic surgery

Mamayusupov K., Fundamenta Mathematicae 2018 Vol. 241 No. 3 P. 265-290

The paper deals with Newton maps of complex exponential functions and a surgery tool developed by P. Haissinsky. The concept of "Postcritically minimal" Newton maps of complex exponential functions are introduced, analogous to postcritically finite Newton maps of polynomials. The dynamics preserving mapping is constructed between the space of postcritically finite Newton maps of polynomials ...

Added: January 11, 2018