ReLU deep neural networks and linear finite elements

From MaRDI portal
Publication:3385667

DOI10.4208/JCM.1901-M2018-0160zbMATH Open1463.68072arXiv1807.03973OpenAlexW3099849883WikidataQ114021207 ScholiaQ114021207MaRDI QIDQ3385667FDOQ3385667


Authors: Juncai He, Lin Li, Chunyue Zheng, Jinchao Xu Edit this on Wikidata


Publication date: 14 January 2021

Published in: Journal of Computational Mathematics (Search for Journal in Brave)

Abstract: In this paper, we investigate the relationship between deep neural networks (DNN) with rectified linear unit (ReLU) function as the activation function and continuous piecewise linear (CPWL) functions, especially CPWL functions from the simplicial linear finite element method (FEM). We first consider the special case of FEM. By exploring the DNN representation of its nodal basis functions, we present a ReLU DNN representation of CPWL in FEM. We theoretically establish that at least 2 hidden layers are needed in a ReLU DNN to represent any linear finite element functions in OmegasubseteqmathbbRd when dge2. Consequently, for d=2,3 which are often encountered in scientific and engineering computing, the minimal number of two hidden layers are necessary and sufficient for any CPWL function to be represented by a ReLU DNN. Then we include a detailed account on how a general CPWL in mathbbRd can be represented by a ReLU DNN with at most lceillog2(d+1)ceil hidden layers and we also give an estimation of the number of neurons in DNN that are needed in such a representation. Furthermore, using the relationship between DNN and FEM, we theoretically argue that a special class of DNN models with low bit-width are still expected to have an adequate representation power in applications. Finally, as a proof of concept, we present some numerical results for using ReLU DNNs to solve a two point boundary problem to demonstrate the potential of applying DNN for numerical solution of partial differential equations.


Full work available at URL: https://arxiv.org/abs/1807.03973




Recommendations





Cited In (62)





This page was built for publication: ReLU deep neural networks and linear finite elements

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3385667)