Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation

From MaRDI portal
Publication:889297


DOI10.1016/j.neunet.2014.06.006zbMath1325.68200arXiv1307.5118WikidataQ39164602 ScholiaQ39164602MaRDI QIDQ889297

Masashi Sugiyama, Tingting Zhao, Voot Tangkaratt, Syogo Mori, Jun Morimoto

Publication date: 6 November 2015

Published in: Neural Networks (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1307.5118


62G07: Density estimation

68T05: Learning and adaptive systems in artificial intelligence



Uses Software


Cites Work