Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation

From MaRDI portal
Revision as of 17:02, 30 January 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:889297


DOI10.1016/j.neunet.2014.06.006zbMath1325.68200arXiv1307.5118WikidataQ39164602 ScholiaQ39164602MaRDI QIDQ889297

Syogo Mori, Voot Tangkaratt, Jun Morimoto, Masashi Sugiyama, Tingting Zhao

Publication date: 6 November 2015

Published in: Neural Networks (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1307.5118


62G07: Density estimation

68T05: Learning and adaptive systems in artificial intelligence


Related Items


Uses Software


Cites Work