The effective noise of stochastic gradient descent
From MaRDI portal
Publication:5043083
Recommendations
- Stochastic gradient descent with noise of machine learning type. I: Discrete time analysis
- On the diffusion approximation of nonconvex stochastic gradient descent
- Stochastic gradient descent with noise of machine learning type. II: Continuous time analysis
- Stochastic gradient descent: where optimization meets machine learning
- Phase diagram of stochastic gradient descent in high-dimensional two-layer neural networks
Cites work
- scientific article; zbMATH DE number 1273988 (Why is no real title available?)
- scientific article; zbMATH DE number 1569102 (Why is no real title available?)
- A mean field view of the landscape of two-layer neural networks
- On the diffusion approximation of nonconvex stochastic gradient descent
- Out-of-equilibrium dynamical mean-field equations for the perceptron model
- Ridge Regression: Biased Estimation for Nonorthogonal Problems
- Robustness and regularization of support vector machines
- The effective temperature
- The inverse variance-flatness relation in stochastic gradient descent is critical for finding flat minima
- Theory of Simple Glasses
Cited in
(5)- Statistical physics of learning in high-dimensional chaotic systems
- Self-consistent dynamical field theory of kernel evolution in wide neural networks
- On weak ergodicity breaking in mean-field spin glasses
- Dynamical mean field theory for models of confluent tissues and beyond
- Stochastic gradient descent with noise of machine learning type. II: Continuous time analysis
This page was built for publication: The effective noise of stochastic gradient descent
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5043083)