Convergence rate analysis for deep Ritz method

Nonparametric estimation (62G05) Artificial neural networks and deep learning (68T07) Neural networks for/in biological studies, artificial life and related topics (92B20) Variational methods applied to PDEs (35A15) Boundary value problems for second-order elliptic equations (35J25) Stability and convergence of numerical methods for boundary value problems involving PDEs (65N12) Error bounds for boundary value problems involving PDEs (65N15) Second-order elliptic equations (35J15) PDEs on graphs and networks (ramified or polygonal spaces) (35R02)

Abstract: Using deep neural networks to solve PDEs has attracted a lot of attentions recently. However, why the deep learning method works is falling far behind its empirical success. In this paper, we provide a rigorous numerical analysis on deep Ritz method (DRM) cite{wan11} for second order elliptic equations with Neumann boundary conditions. We establish the first nonasymptotic convergence rate in

H^{1}

norm for DRM using deep networks with

m a t h r m {R e L U}^{2}

activation functions. In addition to providing a theoretical justification of DRM, our study also shed light on how to set the hyper-parameter of depth and width to achieve the desired convergence rate in terms of number of training samples. Technically, we derive bounds on the approximation error of deep

m a t h r m {R e L U}^{2}

network in

H^{1}

norm and on the Rademacher complexity of the non-Lipschitz composition of gradient norm and

m a t h r m {R e L U}^{2}

network, both of which are of independent interest.

Recommendations

Cites work

Cited in

(30)

Describes a project that uses

Uses Software

This page was built for publication: Convergence rate analysis for deep Ritz method

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5077692)