Projected Policy Gradient Converges in a Finite Number of Iterations

From MaRDI portal
Publication:6514082

arXiv2311.01104MaRDI QIDQ6514082FDOQ6514082


Authors: Jia-Cai Liu, Wenye Li, Ke Wei Edit this on Wikidata














This page was built for publication: Projected Policy Gradient Converges in a Finite Number of Iterations

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6514082)