Convergence Rates Analysis of The Quadratic Penalty Method and Its Applications to Decentralized Distributed Optimization

From MaRDI portal
Publication:6294571

arXiv1711.10802MaRDI QIDQ6294571FDOQ6294571


Authors: Huan Li, Cong Fang, Zhouchen Lin Edit this on Wikidata


Publication date: 29 November 2017

Abstract: In this paper, we study a variant of the quadratic penalty method for linearly constrained convex problems, which has already been widely used but actually lacks theoretical justification. Namely, the penalty parameter steadily increases and the penalized objective function is minimized inexactly rather than exactly, e.g., with only one step of the proximal gradient descent. For such a variant of the quadratic penalty method, we give counterexamples to show that it may not give a solution to the original constrained problem. By choosing special penalty parameters, we ensure the convergence and further establish the convergence rates of Oleft(frac1sqrtKight) for the generally convex problems and Oleft(frac1Kight) for strongly convex ones, where K is the number of iterations. Furthermore, by adopting Nesterov's extrapolation we show that the convergence rates can be improved to Oleft(frac1Kight) for the generally convex problems and Oleft(frac1K2ight) for strongly convex ones. When applied to the decentralized distributed optimization, the penalty methods studied in this paper become the widely used distributed gradient method and the fast distributed gradient method. However, due to the totally different analysis framework, we can improve their Oleft(fraclogKsqrtKight) and Oleft(fraclogKKight) convergence rates to Oleft(frac1sqrtKight) and Oleft(frac1Kight) with fewer assumptions on the network topology for general convex problems. Using our analysis framework, we also extend the fast distributed gradient method to a communication efficient version, i.e., finding an varepsilon solution in Oleft(frac1varepsilonight) communications and Oleft(frac1varepsilon2+deltaight) computations for the non-smooth problems, where delta is a small constant.













This page was built for publication: Convergence Rates Analysis of The Quadratic Penalty Method and Its Applications to Decentralized Distributed Optimization

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6294571)