Gradient descent on infinitely wide neural networks: global convergence and generalization (Q6200217)

From MaRDI portal

Jump to:navigation, search

scientific article; zbMATH DE number 7822598

Language	Label	Description	Also known as
English	Gradient descent on infinitely wide neural networks: global convergence and generalization	scientific article; zbMATH DE number 7822598

Statements

scholarly article

0 references

Gradient descent on infinitely wide neural networks: global convergence and generalization (English)

0 references

0 references

Lénaïc Chizat

0 references

International Congress of Mathematicians

0 references

publication date

22 March 2024

0 references

full work available at URL

https://arxiv.org/abs/2110.08084

0 references

Summary: Many supervised machine learning methods are naturally cast as optimization problems. For prediction models which are linear in their parameters, this often leads to convex problems for which many mathematical guarantees exist. Models which are nonlinear in their parameters such as neural networks lead to nonconvex optimization problems for which guarantees are harder to obtain. In this paper, we consider two-layer neural networks with homogeneous activation functions where the number of hidden neurons tends to infinity, and show how qualitative convergence guarantees may be derived. For the entire collection see [Zbl 07816361].

0 references

zbMATH Keywords

machine learning

0 references

neural networks

0 references

gradient descent

0 references

gradient flow

0 references

optimal transport

0 references

MaRDI profile type

MaRDI publication profile

0 references

0 references

Breaking the Curse of Dimensionality with Convex Neural Networks

0 references

Universal approximation bounds for superpositions of a sigmoidal function

0 references

0 references

0 references

Empirical margin distributions and bounding the generalization error of combined classifiers

0 references

Bounds on rates of variable-basis and neural-network approximation

0 references

0 references

Probabilistic representation and uniqueness results for measure-valued solutions of transport equations

0 references

A mean field view of the landscape of two-layer neural networks

0 references

0 references

Lectures on convex optimization

0 references

Optimal transport for applied mathematicians. Calculus of variations, PDEs, and modeling

0 references

Mean Field Analysis of Neural Networks: A Law of Large Numbers

0 references

0 references

An Introduction to Numerical Analysis

0 references

Asymptotic Statistics

0 references

0 references

Identifiers

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

zbMATH DE Number

0 references

10.4171/ICM2022/121

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:6200217

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q6200217&oldid=39759633"