First-order methods almost always avoid strict saddle points

From MaRDI portal
Publication:2425175




Abstract: We establish that first-order methods avoid saddle points for almost all initializations. Our results apply to a wide variety of first-order methods, including gradient descent, block coordinate descent, mirror descent and variants thereof. The connecting thread is that such algorithms can be studied from a dynamical systems perspective in which appropriate instantiations of the Stable Manifold Theorem allow for a global stability analysis. Thus, neither access to second-order derivative information nor randomness beyond initialization is necessary to provably avoid saddle points.



Cites work


Cited in
(42)


Describes a project that uses

Uses Software





This page was built for publication: First-order methods almost always avoid strict saddle points

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2425175)