Convergence analysis of machine learning algorithms for the numerical solution of mean field control and games. II: The finite horizon case

From MaRDI portal
Publication:2108885

DOI10.1214/21-AAP1715zbMATH Open1505.65243arXiv1908.01613MaRDI QIDQ2108885FDOQ2108885


Authors: Mathieu Laurière, René Carmona Edit this on Wikidata


Publication date: 20 December 2022

Published in: The Annals of Applied Probability (Search for Journal in Brave)

Abstract: We propose two numerical methods for the optimal control of McKean-Vlasov dynamics in finite time horizon. Both methods are based on the introduction of a suitable loss function defined over the parameters of a neural network. This allows the use of machine learning tools, and efficient implementations of stochastic gradient descent in order to perform the optimization. In the first method, the loss function stems directly from the optimal control problem. The second method tackles a generic forward-backward stochastic differential equation system (FBSDE) of McKean-Vlasov type, and relies on suitable reformulation as a mean field control problem. To provide a guarantee on how our numerical schemes approximate the solution of the original mean field control problem, we introduce a new optimization problem, directly amenable to numerical computation, and for which we rigorously provide an error rate. Several numerical examples are provided. Both methods can easily be applied to certain problems with common noise, which is not the case with the existing technology. Furthermore, although the first approach is designed for mean field control problems, the second is more general and can also be applied to the FBSDE arising in the theory of mean field games.


Full work available at URL: https://arxiv.org/abs/1908.01613




Recommendations




Cites Work


Cited In (24)





This page was built for publication: Convergence analysis of machine learning algorithms for the numerical solution of mean field control and games. II: The finite horizon case

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2108885)