An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions

From MaRDI portal
Publication:5380403

DOI10.1162/NECO_A_00808zbMATH Open1472.68149DBLPjournals/neco/MaZHS16OpenAlexW2225522132WikidataQ47600318 ScholiaQ47600318MaRDI QIDQ5380403FDOQ5380403

Kohei Hatano, Yao Ma, Tingting Zhao, Masashi Sugiyama

Publication date: 4 June 2019

Published in: Neural Computation (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1162/neco_a_00808




Recommendations



Cites Work


Cited In (2)





This page was built for publication: An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5380403)