Deep learning architectures for nonlinear operator functions and nonlinear inverse problems

DOI10.4171/MSL/28zbMATH Open1485.35420arXiv1912.11090MaRDI QIDQ2113263FDOQ2113263

Authors: Maarten V. De Hoop, M. Lassas, Christopher A. Wong

Publication date: 11 March 2022

Published in: Mathematical Statistics and Learning (Search for Journal in Brave)

Abstract: We develop a theoretical analysis for special neural network architectures, termed operator recurrent neural networks, for approximating nonlinear functions whose inputs are linear operators. Such functions commonly arise in solution algorithms for inverse boundary value problems. Traditional neural networks treat input data as vectors, and thus they do not effectively capture the multiplicative structure associated with the linear operators that correspond to the data in such inverse problems. We therefore introduce a new family that resembles a standard neural network architecture, but where the input data acts multiplicatively on vectors. Motivated by compact operators appearing in boundary control and the analysis of inverse boundary value problems for the wave equation, we promote structure and sparsity in selected weight matrices in the network. After describing this architecture, we study its representation properties as well as its approximation properties. We furthermore show that an explicit regularization can be introduced that can be derived from the mathematical analysis of the mentioned inverse problems, and which leads to certain guarantees on the generalization properties. We observe that the sparsity of the weight matrices improves the generalization estimates. Lastly, we discuss how operator recurrent networks can be viewed as a deep learning analogue to deterministic algorithms such as boundary control for reconstructing the unknown wavespeed in the acoustic wave equation from boundary measurements.

Full work available at URL: https://arxiv.org/abs/1912.11090

Recommendations

zbMATH Keywords

sparse matrices inverse problems neural networks wave equation

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Neural nets and related approaches to inference from stochastic processes (62M45) Inverse problems for PDEs (35R30)

Cites Work

Cited In (8)

Uses Software

This page was built for publication: Deep learning architectures for nonlinear operator functions and nonlinear inverse problems

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2113263)