Gradient descent provably escapes saddle points in the training of shallow ReLU networks

From MaRDI portal

Publication:6655804

Jump to:navigation, search

DOI10.1007/S10957-024-02513-3MaRDI QIDQ6655804FDOQ6655804

Authors: Patrick Cheridito, Arnulf Jentzen, Florian Rossmannek

Publication date: 27 December 2024

Published in: Journal of Optimization Theory and Applications (Search for Journal in Brave)

Recommendations

Gradient descent optimizes over-parameterized deep ReLU networks
The global optimization geometry of shallow linear neural networks
Landscape analysis for shallow neural networks: complete classification of critical points for affine target functions
Non-differentiable saddle points and sub-optimal local minima exist for deep ReLU networks

zbMATH Keywords

nonconvex optimization neural networks gradient descent center-stable manifolds

Mathematics Subject Classification ID

Numerical optimization and variational techniques (65K10) Artificial neural networks and deep learning (68T07) Nonconvex programming, global optimization (90C26)

Cites Work

Cited In (4)

This page was built for publication: Gradient descent provably escapes saddle points in the training of shallow ReLU networks

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6655804)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6655804&oldid=40232942"