Analysis and improvement of policy gradient estimation

From MaRDI portal
Publication:448295

DOI10.1016/J.NEUNET.2011.09.005zbMATH Open1245.68165DBLPjournals/nn/ZhaoHNS12OpenAlexW2148053762WikidataQ51513131 ScholiaQ51513131MaRDI QIDQ448295FDOQ448295


Authors: Tingting Zhao, Hirotaka Hachiya, Gang Niu, Masashi Sugiyama Edit this on Wikidata


Publication date: 30 August 2012

Published in: Neural Networks (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.neunet.2011.09.005




Recommendations




Cites Work


Cited In (15)





This page was built for publication: Analysis and improvement of policy gradient estimation

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q448295)