Controlled interacting particle algorithms for simulation-based reinforcement learning

DOI10.1016/J.SYSCONLE.2022.105392zbMATH Open1505.49027arXiv2107.01244OpenAlexW4309220322MaRDI QIDQ2107628FDOQ2107628

Authors: Anant A. Joshi, Amirhossein Taghvaei, Prashant G. Mehta, Sean P. Meyn

Publication date: 2 December 2022

Published in: Systems \& Control Letters (Search for Journal in Brave)

Abstract: This paper is concerned with optimal control problems for control systems in continuous time, and interacting particle system methods designed to construct approximate control solutions. Particular attention is given to the linear quadratic (LQ) control problem. There is a growing interest in re-visiting this classical problem, in part due to the successes of reinforcement learning (RL). The main question of this body of research (and also of our paper) is to approximate the optimal control law {em without} explicitly solving the Riccati equation. A novel simulation-based algorithm, namely a dual ensemble Kalman filter (EnKF), is introduced. The algorithm is used to obtain formulae for optimal control, expressed entirely in terms of the EnKF particles. An extension to the nonlinear case is also presented. The theoretical results and algorithms are illustrated with numerical experiments.

Full work available at URL: https://arxiv.org/abs/2107.01244

Recommendations

zbMATH Keywords

duality optimal control Riccati equation reinforcement learning linear quadratic (LQ)

Mathematics Subject Classification ID

Linear-quadratic optimal control problems (49N10) Duality theory (optimization) (49N15)

Cites Work

Cited In (2)

Uses Software

This page was built for publication: Controlled interacting particle algorithms for simulation-based reinforcement learning

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2107628)