A multilevel approach for stochastic nonlinear optimal control

From MaRDI portal
Publication:5863708

DOI10.1080/00207179.2020.1849805zbMATH Open1492.93201arXiv1901.05583OpenAlexW3106433142MaRDI QIDQ5863708FDOQ5863708


Authors: Ajay Jasra, Jeremy Heng, Yaxian Xu, Adrian N. Bishop Edit this on Wikidata


Publication date: 3 June 2022

Published in: International Journal of Control (Search for Journal in Brave)

Abstract: We consider a class of finite time horizon nonlinear stochastic optimal control problem, where the control acts additively on the dynamics and the control cost is quadratic. This framework is flexible and has found applications in many domains. Although the optimal control admits a path integral representation for this class of control problems, efficient computation of the associated path integrals remains a challenging Monte Carlo task. The focus of this article is to propose a new Monte Carlo approach that significantly improves upon existing methodology. Our proposed methodology first tackles the issue of exponential growth in variance with the time horizon by casting optimal control estimation as a smoothing problem for a state space model associated with the control problem, and applying smoothing algorithms based on particle Markov chain Monte Carlo. To further reduce computational cost, we then develop a multilevel Monte Carlo method which allows us to obtain an estimator of the optimal control with mathcalO(epsilon2) mean squared error with a computational cost of mathcalO(epsilon2log(epsilon)2). In contrast, a computational cost of mathcalO(epsilon3) is required for existing methodology to achieve the same mean squared error. Our approach is illustrated on two numerical examples, which validate our theory.


Full work available at URL: https://arxiv.org/abs/1901.05583




Recommendations




Cites Work


Cited In (9)





This page was built for publication: A multilevel approach for stochastic nonlinear optimal control

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5863708)