A proof of convergence for gradient descent in the training of artificial neural networks for constant target functions

From MaRDI portal

Publication:2145074

Jump to:navigation, search

DOI10.1016/j.jco.2022.101646zbMath1502.65037arXiv2102.09924OpenAlexW3132264265WikidataQ113871711 ScholiaQ113871711MaRDI QIDQ2145074

Patrick Cheridito, Adrian Riekert, Florian Rossmannek, Arnulf Jentzen

Publication date: 17 June 2022

Published in: Journal of Complexity (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2102.09924

zbMATH Keywords

nonsmooth optimization nonconvex optimization gradient methods artificial neural networks machine learning

Mathematics Subject Classification ID

Artificial neural networks and deep learning (68T07) Numerical optimization and variational techniques (65K10) Approximation by other special function classes (41A30)

Related Items (4)

Landscape analysis for shallow neural networks: complete classification of critical points for affine target functions ⋮ A proof of convergence for stochastic gradient descent in the training of artificial neural networks with ReLU activation for constant target functions ⋮ Full error analysis for the training of deep neural networks ⋮ Convergence analysis for gradient flows in the training of artificial neural networks with ReLU activation

Cites Work

This page was built for publication: A proof of convergence for gradient descent in the training of artificial neural networks for constant target functions

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2145074&oldid=14650188"