On the rate of convergence of fully connected deep neural network regression estimates

From MaRDI portal
Publication:2054491

DOI10.1214/20-AOS2034zbMATH Open1486.62112arXiv1908.11133MaRDI QIDQ2054491FDOQ2054491


Authors: Michael Kohler, Sophie Langer Edit this on Wikidata


Publication date: 3 December 2021

Published in: The Annals of Statistics (Search for Journal in Brave)

Abstract: Recent results in nonparametric regression show that deep learning, i.e., neural network estimates with many hidden layers, are able to circumvent the so-called curse of dimensionality in case that suitable restrictions on the structure of the regression function hold. One key feature of the neural networks used in these results is that their network architecture has a further constraint, namely the network sparsity. In this paper we show that we can get similar results also for least squares estimates based on simple fully connected neural networks with ReLU activation functions. Here either the number of neurons per hidden layer is fixed and the number of hidden layers tends to infinity suitably fast for sample size tending to infinity, or the number of hidden layers is bounded by some logarithmic factor in the sample size and the number of neurons per hidden layer tends to infinity suitably fast for sample size tending to infinity. The proof is based on new approximation results concerning deep neural networks.


Full work available at URL: https://arxiv.org/abs/1908.11133




Recommendations




Cites Work


Cited In (33)





This page was built for publication: On the rate of convergence of fully connected deep neural network regression estimates

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2054491)