One-dimensional system arising in stochastic gradient descent
From MaRDI portal
Publication:5022277
Abstract: We consider SDEs of the form , where behaves comparably to in a neighborhood of the origin, for . We show that there exists a threshold value for , depending on , such that when then , and for the rest of the permissible values . The previous results extend for discrete processes that satisfy . Here, are martingale differences that are a.s. bounded. This result shows that for a function , whose second derivative at degenerate saddle points is of polynomial order, it is always possible to escape saddle points via the iteration for a suitable choice of .
Recommendations
- On the fast convergence of random perturbations of the gradient flow
- On the diffusion approximation of nonconvex stochastic gradient descent
- Analysis of stochastic gradient descent in continuous time
- Stochastic gradient descent with noise of machine learning type. I: Discrete time analysis
- Publication:4862190
Cites work
- scientific article; zbMATH DE number 5819433 (Why is no real title available?)
- scientific article; zbMATH DE number 5957196 (Why is no real title available?)
- scientific article; zbMATH DE number 4020069 (Why is no real title available?)
- scientific article; zbMATH DE number 1972910 (Why is no real title available?)
- A Stochastic Approximation Method
- A strong law for some generalized urn processes
- A survey of random processes with reinforcement
- Acceleration of Stochastic Approximation by Averaging
- An explicit bound for the Łojasiewicz exponent of real polynomials
- Complete Dictionary Recovery Over the Sphere I: Overview and the Geometric Picture
- Finding approximate local minima faster than gradient descent
- Guaranteed Matrix Completion via Non-Convex Factorization
- Nonconvergence to unstable points in urn models and stochastic approximations
- On the law of the iterated logarithm for martingales
- Recursive Stochastic Algorithms for Global Optimization in $\mathbb{R}^d $
- Statistical inference for model parameters in stochastic gradient descent
- When are Touchpoints Limits for Generalized Polya URNS?
This page was built for publication: One-dimensional system arising in stochastic gradient descent
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5022277)