Projected Policy Gradient Converges in a Finite Number of Iterations
From MaRDI portal
Publication:6514082
arXiv2311.01104MaRDI QIDQ6514082FDOQ6514082
Authors: Jia-Cai Liu, Wenye Li, Ke Wei
This page was built for publication: Projected Policy Gradient Converges in a Finite Number of Iterations
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6514082)