scientific article; zbMATH DE number 2086978
From MaRDI portal
Publication:4737595
zbMATH Open1077.68787MaRDI QIDQ4737595FDOQ4737595
Authors: Martin Stolle, Doina Precup
Publication date: 11 August 2004
Full work available at URL: http://link.springer.de/link/service/series/0558/bibs/2371/23710212.htm
Title of this publication is not available (Why is that?)
Recommendations
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithms
- scientific article; zbMATH DE number 2070740
- Approximate Value Iteration with Temporally Extended Actions
- scientific article; zbMATH DE number 1931843
Cited In (15)
- Learning and control of exploration primitives
- Title not available (Why is that?)
- Induction and Exploitation of Subgoal Automata for Reinforcement Learning
- Title not available (Why is that?)
- Title not available (Why is that?)
- Abstraction from demonstration for efficient reinforcement learning in high-dimensional domains
- Title not available (Why is that?)
- Probabilistic inference for determining options in reinforcement learning
- GENERATING EFFECTIVE INITIATION SETS FOR SUBGOAL-DRIVEN OPTIONS
- Reinforcement learning in the brain
- Offline reinforcement learning with task hierarchies
- Improving reinforcement learning by using sequence trees
- Reward Maximization Through Discrete Active Inference
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4737595)