A Stochastic Newton Algorithm for Distributed Convex Optimization

Bullins, Brian; Patel, Kshitij; Shamir, Ohad; Srebro, Nathan; Woodworth, Blake E.

A Stochastic Newton Algorithm for Distributed Convex Optimization

Brian Bullins, Kshitij Patel, Ohad Shamir, Nathan Srebro, Blake E Woodworth

Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

Bibtex Paper Reviews And Public Comment »

Abstract

We propose and analyze a stochastic Newton algorithm for homogeneous distributed stochastic convex optimization, where each machine can calculate stochastic gradients of the same population objective, as well as stochastic Hessian-vector products (products of an independent unbiased estimator of the Hessian of the population objective with arbitrary vectors), with many such stochastic computations performed between rounds of communication. We show that our method can reduce the number, and frequency, of required communication rounds, compared to existing methods without hurting performance, by proving convergence guarantees for quasi-self-concordant objectives (e.g., logistic regression), alongside empirical evidence.

Abstract

Name Change Policy