Publication:3174040
From MaRDI portal
zbMath1222.90077MaRDI QIDQ3174040
Andrew G. Barto, Anders Jonsson
Publication date: 12 October 2011
Full work available at URL: http://www.jmlr.org/papers/v7/jonsson06a.html
68T05: Learning and adaptive systems in artificial intelligence
90C40: Markov and semi-Markov decision processes
Related Items
Offline reinforcement learning with task hierarchies, AUTOMATIC COMPLEXITY REDUCTION IN REINFORCEMENT LEARNING