Publication:3174169

From MaRDI portal

Jump to:navigation, search

zbMath1222.68202MaRDI QIDQ3174169

Mohammad Ghavamzadeh, Sridhar Mahadevan

Publication date: 12 October 2011

Full work available at URL: http://www.jmlr.org/papers/v8/ghavamzadeh07a.html

zbMATH Keywords

semi-Markov decision processes; hierarchical reinforcement learning; average reward reinforcement learning; hierarchical and recursive optimality

Mathematics Subject Classification ID

68T05: Learning and adaptive systems in artificial intelligence

Related Items

Probabilistic inference for determining options in reinforcement learning, Exact decomposition approaches for Markov decision processes: a survey, Reinforcement learning algorithms with function approximation: recent advances and applications

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3174169&oldid=16413465"