Transfer in variable-reward hierarchical reinforcement learning
From MaRDI portal
Publication:1009300
DOI10.1007/s10994-008-5061-yzbMath1470.68147OpenAlexW2132057084MaRDI QIDQ1009300
Prasad Tadepalli, Alan Fern, Sriraam Natarajan, Neville Mehta
Publication date: 31 March 2009
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10994-008-5061-y
Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40)
Related Items
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Planning and acting in partially observable stochastic domains
- Model-based average reward reinforcement learning
- Multi-objective infinite-horizon discounted Markov decision processes
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Constrained Markov Decision Models with Weighted Discounted Rewards
- Building Relational World Models for Reinforcement Learning