Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model

From MaRDI portal
Publication:399890


DOI10.1007/s10994-013-5368-1zbMath1295.68180arXiv1206.6461OpenAlexW2120678009MaRDI QIDQ399890

Hilbert J. Kappen, Rémi Munos, Mohammad Gheshlaghi Azar

Publication date: 20 August 2014

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1206.6461



Related Items



Cites Work