On optimal probabilities in stochastic coordinate descent methods

DOI10.1007/S11590-015-0916-1MaRDI QIDQ315487zbMATH OpenOpenAlexWikidataFDO

Authors Peter Richtárik, M. Takáč

Publication date 21 September 2016

Published in Optimization Letters (Search for Journal in Brave)

Copyright license Creative Commons Attribution 4.0 International

Full work available at URL https://arxiv.org/abs/1310.3438

zbMATH Keywords

complexity coordinate descent arbitrary sampling first-order method

Mathematics Subject Classification ID

Nonlinear programming (90C30)

Abstract: We propose and analyze a new parallel coordinate descent method---`NSync---in which at each iteration a random subset of coordinates is updated, in parallel, allowing for the subsets to be chosen non-uniformly. We derive convergence rates under a strong convexity assumption, and comment on how to assign probabilities to the sets to optimize the bound. The complexity and practical performance of the method can outperform its uniform variant by an order of magnitude. Surprisingly, the strategy of updating a single randomly selected coordinate per iteration---with optimal probabilities---may require less iterations, both in theory and practice, than the strategy of updating all coordinates at every iteration.

Recommendations

Cites work

Cited in

(28)

This page was built for publication: On optimal probabilities in stochastic coordinate descent methods

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q315487)