Accelerating gradient descent and Adam via fractional gradients (Q6057934)

From MaRDI portal

Jump to:navigation, search

scientific article; zbMATH DE number 7755615

Language	Label	Description	Also known as
English	Accelerating gradient descent and Adam via fractional gradients	scientific article; zbMATH DE number 7755615

Statements

scholarly article

0 references

Accelerating gradient descent and Adam via fractional gradients (English)

0 references

0 references

Jérôme Darbon

0 references

George Em. Karniadakis

0 references

Neural Networks

0 references

publication date

26 October 2023

0 references

The paper proposed a general class of fractional-order optimization algorithms via the Caputo fractional derivatives. Introducing an interesting theorem called Theorem 2.5 which is serves as the theoretical motivation on using the Caputo fractional derivatives in optimization. Based on Theorem 2.5, defined the Caputo fractional-based gradient, which generalizes the standard integer-order gradient. An efficient implementation developed. By replacing integer-order gradients with the Caputo fractional-based ones, proposed the Caputo fractional gradient descent (CfGD) and the Caputo fractional Adam (CfAdam) that generalize GD and Adam, respectively. Given concrete algorithms, consider gradient descent(GD) and Adam, and extend them to the Caputo fractional GD (CfGD) and the Caputo fractional Adam (CfAdam). Demonstrate the superiority of CfGD and CfAdam on several large scale optimization problems that arise from scientific machine learning applications, such as ill-conditioned least squares problem on real-world data and the training of neural networks involving non-convex objective functions. Numerical examples show that both CfGD and CfAdam result in acceleration over GD and Adam, respectively.

0 references

0 references

zbMATH Keywords

Caputo fractional derivative

0 references

non-local calculus

0 references

optimization

0 references

Adam

0 references

neural networks

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1016/j.neunet.2023.01.002

0 references

Numerical optimization. Theoretical and practical aspects. Transl. from the French

0 references

Study on fractional order gradient methods

0 references

Numerical methods for nonlocal and fractional models

0 references

Towards a unified theory of fractional and nonlocal vector calculus

0 references

Tikhonov Regularization and Total Least Squares

0 references

Cauchy and the gradient method

0 references

Fractional differential equation approach for convex optimization with convergence rate analysis

0 references

Localization of nonlocal gradients in various topologies

0 references

Introductory lectures on convex optimization. A basic course.

0 references

0 references

0 references

Fractional vector calculus and fractional Maxwell's equations

0 references

Fractional-order gradient descent learning of BP neural networks with Caputo derivative

0 references

Generalization of the gradient method with fractional order gradient direction

0 references

Identifiers

Mathematics Subject Classification ID

0 references

0 references

zbMATH DE Number

0 references

0 references

10.1016/J.NEUNET.2023.01.002

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:6057934

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q6057934&oldid=39674745"