Publication:3174169
From MaRDI portal
zbMath1222.68202MaRDI QIDQ3174169
Mohammad Ghavamzadeh, Sridhar Mahadevan
Publication date: 12 October 2011
Full work available at URL: http://www.jmlr.org/papers/v8/ghavamzadeh07a.html
semi-Markov decision processes; hierarchical reinforcement learning; average reward reinforcement learning; hierarchical and recursive optimality
68T05: Learning and adaptive systems in artificial intelligence
Related Items
Probabilistic inference for determining options in reinforcement learning, Exact decomposition approaches for Markov decision processes: a survey, Reinforcement learning algorithms with function approximation: recent advances and applications