Deep neural networks, generic universal interpolation, and controlled ODEs

From MaRDI portal
Publication:5037577

DOI10.1137/19M1284117zbMATH Open1485.93062arXiv1908.07838OpenAlexW3089885363MaRDI QIDQ5037577FDOQ5037577


Authors: Christa Cuchiero, Martin Larsson, Josef Teichmann Edit this on Wikidata


Publication date: 1 March 2022

Published in: SIAM Journal on Mathematics of Data Science (Search for Journal in Brave)

Abstract: A recent paradigm views deep neural networks as discretizations of certain controlled ordinary differential equations, sometimes called neural ordinary differential equations. We make use of this perspective to link expressiveness of deep networks to the notion of controllability of dynamical systems. Using this connection, we study an expressiveness property that we call universal interpolation, and show that it is generic in a certain sense. The universal interpolation property is slightly weaker than universal approximation, and disentangles supervised learning on finite training sets from generalization properties. We also show that universal interpolation holds for certain deep neural networks even if large numbers of parameters are left untrained, and are instead chosen randomly. This lends theoretical support to the observation that training with random initialization can be successful even when most parameters are largely unchanged through the training. Our results also explore what a minimal amount of trainable parameters in neural ordinary differential equations could be without giving up on expressiveness.


Full work available at URL: https://arxiv.org/abs/1908.07838




Recommendations




Cites Work


Cited In (13)

Uses Software





This page was built for publication: Deep neural networks, generic universal interpolation, and controlled ODEs

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5037577)