Categorical foundations of gradient-based learning

DOI10.1007/978-3-030-99336-8_1MaRDI QIDQ6166781zbMATH OpenOpenAlexFDO

Authors G. S. H. Cruttwell, Bruno Gavranović, Neil Ghani, Paul W. Wilson, Fabio Zanasi

Publication date 3 August 2023

Published in Programming Languages and Systems (Search for Journal in Brave)

Copyright license Creative Commons Attribution 4.0 International

Full work available at URL https://arxiv.org/abs/2103.01931

Learning and adaptive systems in artificial intelligence (68T05) Categorical semantics of formal languages (18C50)

Abstract: We propose a categorical semantics of gradient-based machine learning algorithms in terms of lenses, parametrised maps, and reverse derivative categories. This foundation provides a powerful explanatory and unifying framework: it encompasses a variety of gradient descent algorithms such as ADAM, AdaGrad, and Nesterov momentum, as well as a variety of loss functions such as as MSE and Softmax cross-entropy, shedding new light on their similarities and differences. Our approach to gradient-based learning has examples generalising beyond the familiar continuous domains (modelled in categories of smooth maps) and can be realized in the discrete setting of boolean circuits. Finally, we demonstrate the practical significance of our framework with an implementation in Python.

Recommendations

Cites work

Cited in

(20)

This page was built for publication: Categorical foundations of gradient-based learning

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6166781)