Landscape analysis for shallow neural networks: complete classification of critical points for affine target functions

From MaRDI portal
Publication:2156337

DOI10.1007/S00332-022-09823-8zbMATH Open1491.68179arXiv2103.10922OpenAlexW4284712609MaRDI QIDQ2156337FDOQ2156337

Patrick Cheridito, Florian Rossmannek, Arnulf Jentzen

Publication date: 18 July 2022

Published in: Journal of Nonlinear Science (Search for Journal in Brave)

Abstract: In this paper, we analyze the landscape of the true loss of neural networks with one hidden layer and ReLU, leaky ReLU, or quadratic activation. In all three cases, we provide a complete classification of the critical points in the case where the target function is affine and one-dimensional. In particular, we show that there exist no local maxima and clarify the structure of saddle points. Moreover, we prove that non-global local minima can only be caused by `dead' ReLU neurons. In particular, they do not appear in the case of leaky ReLU or quadratic activation. Our approach is of a combinatorial nature and builds on a careful analysis of the different types of hidden neurons that can occur.


Full work available at URL: https://arxiv.org/abs/2103.10922




Recommendations




Cites Work


Cited In (7)





This page was built for publication: Landscape analysis for shallow neural networks: complete classification of critical points for affine target functions

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2156337)