Publication:2810874
From MaRDI portal
zbMath1360.68674MaRDI QIDQ2810874
Yaakov Engel, Mohammad Ghavamzadeh, Michal Valko
Publication date: 6 June 2016
Full work available at URL: http://jmlr.csail.mit.edu/papers/v17/10-245.html
60G15: Gaussian processes
62F15: Bayesian inference
68T05: Learning and adaptive systems in artificial intelligence
Related Items
Unnamed Item, Hessian matrix distribution for Bayesian policy gradient reinforcement learning, Natural actor-critic algorithms, Bayesian policy reuse