Policy gradient methods for discrete time linear quadratic regulator with random parameters
From MaRDI portal
Publication:6491779
DOI10.1051/cocv/2024014MaRDI QIDQ6491779
Publication date: 24 April 2024
Published in: European Series in Applied and Industrial Mathematics (ESAIM): Control, Optimization and Calculus of Variations (Search for Journal in Brave)
reinforcement learningrandom parameterslinear quadratic optimal controlsub-Gaussianitymodel-free policy gradient method
Analysis of algorithms (68W40) Linear-quadratic optimal control problems (49N10) Stochastic learning and adaptive control (93E35)
This page was built for publication: Policy gradient methods for discrete time linear quadratic regulator with random parameters