Entropic gradient descent algorithms and wide flat minima*
From MaRDI portal
Publication:5020063
DOI10.1088/1742-5468/ac3ae8OpenAlexW4205795626MaRDI QIDQ5020063
Christoph Feinauer, Gabriele Perugini, Elizaveta Demyanenko, Riccardo Zecchina, Carlo Baldassi, Carlo Lucibello, Fabrizio Pittorino
Publication date: 3 January 2022
Published in: Journal of Statistical Mechanics: Theory and Experiment (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2006.07897
Related Items
Uses Software
Cites Work
- Deep relaxation: partial differential equations for optimizing deep neural networks
- Flat Minima
- Local entropy as a measure for sampling solutions in constraint satisfaction problems
- Information, Physics, and Computation
- Entropy-SGD: biasing gradient descent into wide valleys
- Shaping the learning landscape in neural networks around wide flat minima
- Unnamed Item
- Unnamed Item