scientific article; zbMATH DE number 2086978
From MaRDI portal
Publication:4737595
Recommendations
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithms
- scientific article; zbMATH DE number 2070740
- Approximate Value Iteration with Temporally Extended Actions
- scientific article; zbMATH DE number 1931843
Cited in
(16)- Learning and control of exploration primitives
- Offline reinforcement learning with task hierarchies
- scientific article; zbMATH DE number 2079791 (Why is no real title available?)
- Induction and exploitation of subgoal automata for reinforcement learning
- GENERATING EFFECTIVE INITIATION SETS FOR SUBGOAL-DRIVEN OPTIONS
- scientific article; zbMATH DE number 2070740 (Why is no real title available?)
- Reward-respecting subtasks for model-based reinforcement learning
- Reinforcement learning in the brain
- Inverse reinforcement learning via nonparametric spatio-temporal subgoal modeling
- Improving reinforcement learning by using sequence trees
- Reward Maximization Through Discrete Active Inference
- Probabilistic inference for determining options in reinforcement learning
- Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning
- scientific article; zbMATH DE number 1931843 (Why is no real title available?)
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Abstraction from demonstration for efficient reinforcement learning in high-dimensional domains
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4737595)