Dynamics of stochastic gradient descent for two-layer neural networks in the teacher–student setup*

From MaRDI portal

Publication:5857458

Jump to:navigation, search

DOI10.1088/1742-5468/abc61eOpenAlexW3113714439MaRDI QIDQ5857458

Madhu S. Advani, Andrew M. Saxe, Lenka Zdeborová, Florent Krzakala, Sebastian Goldt

Publication date: 1 April 2021

Published in: Journal of Statistical Mechanics: Theory and Experiment (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1906.08632

zbMATH Keywords

machine learning

Mathematics Subject Classification ID

Statistical mechanics, structure of matter (82-XX)

Related Items

Align, then memorise: the dynamics of learning with feedback alignment*, Towards interpreting deep neural networks via layer behavior understanding, Free dynamics of feature learning processes, High‐dimensional limit theorems for SGD: Effective dynamics and critical scaling, Symmetry \& critical points for a model shallow neural network

Uses Software

Entropy-SGD

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5857458&oldid=30709867"