Traveling wave solutions of partial differential equations via neural networks (Q1983171): Difference between revisions

Review Report: The investigation of travelling wave solutions of coupled PDEs in recent years has been found in chemical physics, mathematical biology and host of applied sciences. Over the past three decades the problem has been the subject of much interest and become an important area of research. So it is a matter of great significance to investigate the travelling wave solutions and the associated speed, thereby gaining insight into natural and bio-chemical phenomena. In the present paper, the authors propose a novel method for approximating travelling wave solutions via deep learning networks and apply them to three selected problems of current interest, namely the Keller-Segel model of chemotaxis, the Allen-Cahn model of chemical kinetics and the Lotka-Volterra model of population regulation modelled by coupled PDEs, nonlinear in nature. The motivation towards analysing such equations arises from the methodological treatment conducted by other researchers in favour of exploring the aforementioned models for their dynamics and applications. Each of these models is known to have a unique solution and the solutions are widely studied (the authors). The motivation for analysing them using ANN stems from the related previous work from [\textit{H. J. Hwang} et al., J. Comput. Phys. 419, Article ID 109665, 25 p. (2020; Zbl 07507228); \textit{H. Jo} et al., Netw. Heterog. Media 15, No. 2, 247--259 (2020; Zbl 1442.35474); \textit{J. Sirignano} and \textit{K. Spiliopoulos}, J. Comput. Phys. 375, 1339--1364 (2018; Zbl 1416.65394)] and the universal approximation theorem (UAT) of neural networks. Content and structure: The material in this paper is structured as follows. Section 2 is devoted to the introduction of an abstract model that attains travelling wave solutions in the general context and a detailed description of the proposed methodology for finding approximations to the travelling wave solutions and the corresponding speed using the corresponding neural network model. Loss functions using the $L2$ error of the governing equations are defined. Due to the difficulty in imposing the boundary conditions, loss functions are defined appropriately. Similarly to estimate the speed of the wave more accurately, Neumann boundary conditions are added at the boundaries of the truncated interval at which the solution satisfies asymptotically and losses are defined using mean of the limits to overcome the effect of translation. A training procedure consisting of two parts: feed-forward and back-propagation is briefly explained. The optimization process is employed to reduce the total loss created by combining all the losses defined above. A schematic diagram (overall architecture) is presented. The optimization problem can be solved by the gradient descent (GD) algorithm. Partial derivatives of loss functions can be computed easily by automatic differentiation (AD) [\textit{A. Paszke} et al., Automatic differentiation in pytorch. NeurIPS, Autodiff Workshop (2017)] and ADAM an optimizer is employed [\textit{D. P. Kingma} and \textit{J. Ba}, ``Adam: a method for stochastic optimization'', Preprint, \url{arXiv:1412.6980}]. Section 3 discusses the modalities of the deep neural network applied to the KS model for the approximation of travelling wave solutions and the corresponding speed. Primarily, the authors deal with the classical KS model. By imposing the travelling wave ansatz, a coupled ODE is obtained with boundary conditions. For the existence and uniqueness of the solution, a proposition [\textit{T. Li} and \textit{Z.-A. Wang}, Math. Biosci. 240, No. 2, 161--168 (2012; Zbl 1316.92013)] is invoked which gives expression to compute wave speed explicitly. To represent the set of functions that the neural network can approximate, the authors refer to the definition and theorem from [\textit{X. Li}, Neurocomputing 12, No. 4, 327--343 (1996; Zbl 0861.41013)]. As part of the theoretical background, two theorems are stated with proofs which ensure convergence of value of loss function to zero and estimated speed to the correct value. Numerical experiments of the KS model are conducted with sufficiently small to guarantee the existence and uniqueness of solutions and varying model parameters. With few modifications, the extension of the classical model with domain $\mathbb{R}$ to the multi-dimensional model in $\mathbb{R}^n$ is demonstrated choosing $n=4$. Overall it is observed, according to the authors, that the proposed method can be used to approximate travelling wave solutions in higher dimensions. Section 4 deals with an application of the proposed method to the Allen-Cahn model with relaxation. As in the previous section, the original domain, real line, is truncated interval $[-a, a]$ and learning is done within it. Numerical results are obtained choosing $a=200$ and varying model parameters. Experiments are conducted to estimate the width of the interval to obtain a reasonably good approximation of solution for the model under investigation. Section 5 focuses on an application of the proposed method to the LV competition model with two species. The existence and uniqueness of the solution is established in [\textit{Y. Kan-on}, SIAM J. Math. Anal. 26, No. 2, 340--363 (1995; Zbl 0821.34048)]. To the best of the authors' knowledge, the only known fact about speed in this model is its sign. The first experiment is aimed at approximating standing wave (wave front), the only case in which the exact speed is known. Finally, Section 6 corresponds to the conclusion where the area of further research is explored. The difficulty arising due to unboundedness of the domain is addressed by truncating the real line. Moreover, to improve the accuracy of the approximate solution addition of Neumann boundary condition at the end points of the truncated interval is justified. However, there remain some unresolved issues to be addressed forming part of authors' future work. Observations and comments: 1) (O) In the present paper, the authors apply ANNs to physical, chemical and biological phenomena modelled by coupled nonlinear PDEs admitting travelling wave solutions. (C) ANNs provide an ideal representation tool for PDE solutions because they are characterized by adjustable parameters that can be modified by incremental training algorithms. 2) (O) In this paper, the authors show that deep neural networks have powerful function-fitting and approximating capabilities and have great potential in the study of partial differential equations. (C) ANN solutions of PDEs are characterized by other advantages over FDM and FEM solutions that are especially important in non-stationary environments. 3) (O) This paper provides a natural paradigm for solving PDEs via ANNs, because the ANN can be adapted to minimize the appropriately defined loss functions by the governing equations and boundary conditions. (C) Nowadays, there has been a growing number of researchers while using deep learning methods to study partial differential equations. 4) (O) The authors employe the most straightforward and popular optimization algorithm ADAM based on gradient descent. (C) The optimization algorithm determines how the adjustment of the parameters in the neural network takes place. 5) (O) Plots of estimated wave speed and trajectories of the total loss in training epochs for different model parameters of the A-C and L-V models are provided. (C) Accuracy of an ANN model depends on a large number of parameters such as weights, bias, number of hidden layers, different kinds of activation functions and hyper-parameters. Epochs is a form of hyper-parameter which plays an integral part in the training process of a model. 6) (O) Sigmoid/logistic activation function and $tan h$ (tan hyperbolic) functions are used as activation functions. (O) The primary role of the activation function in the ANN model is to transform the summed weighted input from the node into an output value to be fed to the next hidden layer or as output. The main purpose of an activation function is to add non-linearity to the neural network. The nonlinear activation functions are the most used activation functions. Both functions are differentiable and monotonic with ranges $[0,1]$ and $[-1,1]$, respectively. 7) (O) The $L2$-norm loss function, also known as least squares error (LSE), is used to define the total loss function for the selected models. (C) Neural networks are trained using an optimization process that requires a loss function to calculate the model error. A loss function measures the quality of the network's output. 8) (O) LeCun initialization, Xavier uniform initialization are the type of initializations used for numerical experiments. (C) The initialization on biases and weights is a step that can be critical to the model's ultimate performance, and it depends on the choice of activation function. Xavier initialization works with $\tan h$ activations. Sometimes it helps to understand the mathematical justification to grasp the concept. Both aim to express the variance of the weights according to their respective inputs and outputs. 9) (O) The validation and verification of the neural network model of the proposed method including selection of appropriate error metrics, relies on theoretical results derived and proved from the related and present works. (C) Neural network models are data driven and therefore resist analytical or theoretical validation, and therefore must be empirically validated. 10) (O) ANN model developed by the authors is able to approximate travelling wave solutions and corresponding speeds with good precision and fast convergence. (C) Nowadays there has been a growing number of researchers where using deep learning methods study PDEs, systems of PDEs, and coupled nonlinear PDEs. Main contributions: 1) Introduction of an additional loss function to handle infinite domains; 2) Simultaneous approximation of travelling wave solutions and the wave speed of given PDEs; 3) Theoretical evidence to guarantee the wave speed; 4) Providing a unique solution as a correct answer even in cases where uniqueness is not guaranteed; 5) Overcoming the curse of dimensionality. Concluding remarks: Using the method presented in this paper, the authors perform the experimental analysis validating theoretical results of three important partial differential equations. The method proposed in this paper achieves good experimental results due to the powerful function approximation ability of neural networks and the physical, chemical and biological information contained in the NCPDEs (nonlinear coupled partial differential equations). Although the method used in this paper has many advantages, such as not having to consider the discretization of PDEs and eliminating the need of interpolation to cover the entire domain of the problem. However, the method also faces many problems, such as the neural network for solving PDEs relies heavily on training data, which often requires more training time when the quality of the training data is poor. Therefore, it is also important to investigate how to construct high-quality training datasets to reduce the training time. In this paper, authors focus primarily on the study one-dimensional coupled partial differential equations nonlinear in nature, using deep learning ANN. To the best of author's knowledge and belief, the method can be extended to multi-dimensional problems with a few modifications to the proposed method.

0 references

reviewed by

Chandrasekhar Salimath

0 references

zbMATH Keywords

traveling wave solution

0 references

estimation of wave speed

0 references

neural networks

0 references

convergence

0 references

describes a project that uses

0 references

0 references

0 references

0 references

MaRDI publication profile

0 references

cites work

Exact minimum speed of traveling waves in a Keller-Segel model

0 references

The existence of minimum speed of traveling wave solutions to a non-KPP isothermal diffusion system

0 references

The sign of the wave speed for the Lotka-Volterra competition-diffusion system

0 references

Traveling Wave and Multiple Traveling Wave Solutions of Parabolic Equations

0 references

The Numerical Calculation of Traveling Wave Solutions of Nonlinear Parabolic Equations

0 references

Exp-function method for nonlinear wave equations

0 references

Q2885005

0 references

Trend to equilibrium for the kinetic Fokker-Planck equation via the neural network approach

0 references

Deep neural network approach to forward-inverse problems

0 references

Parameter Dependence of Propagation Speed of Travelling Waves for Competition-Diffusion Equations

0 references

Exact and numerical traveling wave solutions for nonlinear coupled equations using symbolic computation

0 references

Traveling bands of chemotactic bacteria: a theoretical analysis

0 references

Transient Bounds and Time-Asymptotic Behavior of Solutions to Nonlinear Equations of Fisher Type

0 references

Analytical and numerical investigation of traveling waves for the Allen–Cahn model with relaxation

0 references

Asymptotic nonlinear stability of traveling waves to conservation laws arising from chemotaxis

0 references

Steadily propagating waves of a chemotaxis model

0 references

Simultaneous approximations of multivariate functions and their derivatives by neural networks with one hidden layer

0 references

DeepXDE: A Deep Learning Library for Solving Differential Equations

0 references

Solitary wave solutions of nonlinear wave equations

0 references

Traveling wave solutions of a nonlinear reaction-diffusion-chemotaxis model for bacterial pattern formation

0 references

Rogue waves, bright-dark solitons and traveling wave solutions of the $(3+1)$-dimensional generalized Kadomtsev-Petviashvili equation

0 references

Spreading speeds and traveling waves of a parabolic-elliptic chemotaxis system with logistic source on $\mathbb{R}^N$

0 references

DGM: a deep learning algorithm for solving partial differential equations

0 references

New traveling wave exact and approximate solutions for the nonlinear Cahn-Allen equation: evolution of a nonconserved quantity

0 references

A new Riccati equation rational expansion method and its application to $(2+1)$-dimensional Burgers equation

0 references

Existence and stability of traveling wave fronts in reaction advection diffusion equations with nonlocal delay

0 references

The tanh-coth method for solitons and kink solutions for nonlinear parabolic equations

0 references

Uniqueness and exponential stability of traveling wave fronts for a multi-type SIS nonlocal epidemic model

0 references

Existence and stability of traveling waves in periodic media governed by a bistable nonlinearity

0 references

Sharp bounds for the ratio of two zeta functions

0 references

Identifiers

zbMATH Open document ID

1503.65273

0 references

DOI

10.1007/s10915-021-01621-w

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1983171

@@ Property / author @@
-Hyung-Ju Hwang
@@ Property / author: Hyung-Ju Hwang / rank @@
-Normal rank
@@ Property / author @@
+Hyung-Ju Hwang
@@ Property / author: Hyung-Ju Hwang / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+DGM
@@ Property / describes a project that uses: DGM / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+PyTorch
@@ Property / describes a project that uses: PyTorch / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+DeepXDE
@@ Property / describes a project that uses: DeepXDE / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+Adam
@@ Property / describes a project that uses: Adam / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / OpenAlex ID @@
+W3197436794
@@ Property / OpenAlex ID: W3197436794 / rank @@
+Normal rank
@@ Property / arXiv ID @@
+.08520
@@ Property / arXiv ID: 2101.08520 / rank @@
+Normal rank
@@ Property / cites work @@
+Exact minimum speed of traveling waves in a Keller-Segel model
+Normal rank
@@ Property / cites work @@
+The existence of minimum speed of traveling wave solutions to a non-KPP isothermal diffusion system
+Normal rank
@@ Property / cites work @@
+The sign of the wave speed for the Lotka-Volterra competition-diffusion system
+Normal rank
@@ Property / cites work @@
+Traveling Wave and Multiple Traveling Wave Solutions of Parabolic Equations
+Normal rank
@@ Property / cites work @@
+The Numerical Calculation of Traveling Wave Solutions of Nonlinear Parabolic Equations
+Normal rank
@@ Property / cites work @@
+Exp-function method for nonlinear wave equations
@@ Property / cites work: Exp-function method for nonlinear wave equations / rank @@
+Normal rank
@@ Property / cites work @@
+Q2885005
@@ Property / cites work: Q2885005 / rank @@
+Normal rank
@@ Property / cites work @@
+Trend to equilibrium for the kinetic Fokker-Planck equation via the neural network approach
+Normal rank
@@ Property / cites work @@
+Deep neural network approach to forward-inverse problems
+Normal rank
@@ Property / cites work @@
+Parameter Dependence of Propagation Speed of Travelling Waves for Competition-Diffusion Equations
+Normal rank
@@ Property / cites work @@
+Exact and numerical traveling wave solutions for nonlinear coupled equations using symbolic computation
+Normal rank
@@ Property / cites work @@
+Traveling bands of chemotactic bacteria: a theoretical analysis
+Normal rank
@@ Property / cites work @@
+Transient Bounds and Time-Asymptotic Behavior of Solutions to Nonlinear Equations of Fisher Type
+Normal rank
@@ Property / cites work @@
+Analytical and numerical investigation of traveling waves for the Allen–Cahn model with relaxation
+Normal rank
@@ Property / cites work @@
+Asymptotic nonlinear stability of traveling waves to conservation laws arising from chemotaxis
+Normal rank
@@ Property / cites work @@
+Steadily propagating waves of a chemotaxis model
@@ Property / cites work: Steadily propagating waves of a chemotaxis model / rank @@
+Normal rank
@@ Property / cites work @@
+Simultaneous approximations of multivariate functions and their derivatives by neural networks with one hidden layer
+Normal rank
@@ Property / cites work @@
+DeepXDE: A Deep Learning Library for Solving Differential Equations
+Normal rank
@@ Property / cites work @@
+Solitary wave solutions of nonlinear wave equations
+Normal rank
@@ Property / cites work @@
+Traveling wave solutions of a nonlinear reaction-diffusion-chemotaxis model for bacterial pattern formation
+Normal rank
@@ Property / cites work @@
+Rogue waves, bright-dark solitons and traveling wave solutions of the \((3+1)\)-dimensional generalized Kadomtsev-Petviashvili equation
+Normal rank
@@ Property / cites work @@
+Spreading speeds and traveling waves of a parabolic-elliptic chemotaxis system with logistic source on \(\mathbb{R}^N\)
+Normal rank
@@ Property / cites work @@
+DGM: a deep learning algorithm for solving partial differential equations
+Normal rank
@@ Property / cites work @@
+New traveling wave exact and approximate solutions for the nonlinear Cahn-Allen equation: evolution of a nonconserved quantity
+Normal rank
@@ Property / cites work @@
+A new Riccati equation rational expansion method and its application to \((2+1)\)-dimensional Burgers equation
+Normal rank
@@ Property / cites work @@
+Existence and stability of traveling wave fronts in reaction advection diffusion equations with nonlocal delay
+Normal rank
@@ Property / cites work @@
+The tanh-coth method for solitons and kink solutions for nonlinear parabolic equations
+Normal rank
@@ Property / cites work @@
+Uniqueness and exponential stability of traveling wave fronts for a multi-type SIS nonlocal epidemic model
+Normal rank
@@ Property / cites work @@
+Existence and stability of traveling waves in periodic media governed by a bistable nonlinearity
+Normal rank
@@ Property / cites work @@
+Sharp bounds for the ratio of two zeta functions
@@ Property / cites work: Sharp bounds for the ratio of two zeta functions / rank @@
+Normal rank