scientific article
From MaRDI portal
Publication:3174169
zbMath1222.68202MaRDI QIDQ3174169
Mohammad Ghavamzadeh, Sridhar Mahadevan
Publication date: 12 October 2011
Full work available at URL: http://www.jmlr.org/papers/v8/ghavamzadeh07a.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
semi-Markov decision processeshierarchical reinforcement learningaverage reward reinforcement learninghierarchical and recursive optimality
Related Items (3)
Probabilistic inference for determining options in reinforcement learning ⋮ Exact decomposition approaches for Markov decision processes: a survey ⋮ Reinforcement learning algorithms with function approximation: recent advances and applications
This page was built for publication: