Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
From MaRDI portal
Publication:1606316
DOI10.1016/S0004-3702(99)00052-1zbMath0996.68151OpenAlexW2109910161MaRDI QIDQ1606316
Doina Precup, Richard S. Sutton, Satinder Pal Singh
Publication date: 24 July 2002
Published in: Artificial Intelligence (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/s0004-3702(99)00052-1
Related Items (46)
A time aggregation approach to Markov decision processes ⋮ Clustering Based Approximation Procedure for Semi-Markov Decision Processes with Incomplete State Information ⋮ On Efficient Reinforcement Learning for Full-length Game of StarCraft II ⋮ Multiple Model-Based Reinforcement Learning ⋮ Probabilistic inference for determining options in reinforcement learning ⋮ Actor-critic algorithms for hierarchical Markov decision processes ⋮ A policy gradient method for semi-Markov decision processes with application to call admission control ⋮ Hybrid MDP based integrated hierarchical Q-learning ⋮ Detect, understand, act: a neuro-symbolic hierarchical reinforcement learning framework ⋮ Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning ⋮ Continual curiosity-driven skill acquisition from high-dimensional video inputs for humanoid robots ⋮ Exact decomposition approaches for Markov decision processes: a survey ⋮ Automated Reinforcement Learning (AutoRL): A Survey and Open Problems ⋮ Designing decentralized controllers for distributed-air-jet MEMS-based micromanipulators by reinforcement learning ⋮ AUTOMATIC COMPLEXITY REDUCTION IN REINFORCEMENT LEARNING ⋮ Reward-respecting subtasks for model-based reinforcement learning ⋮ Overlapping layered learning ⋮ Unnamed Item ⋮ Unnamed Item ⋮ From Reinforcement Learning to Deep Reinforcement Learning: An Overview ⋮ Automated imbalanced classification via layered learning ⋮ Offline reinforcement learning with task hierarchies ⋮ Reinforcement learning algorithms with function approximation: recent advances and applications ⋮ GENERATING EFFECTIVE INITIATION SETS FOR SUBGOAL-DRIVEN OPTIONS ⋮ Hierarchical method for cooperative multiagent reinforcement learning in Markov decision processes ⋮ Unnamed Item ⋮ Improving reinforcement learning by using sequence trees ⋮ Optimal Curiosity-Driven Modular Incremental Slow Feature Analysis ⋮ Abstraction from demonstration for efficient reinforcement learning in high-dimensional domains ⋮ Learning and control of exploration primitives ⋮ Deep Reinforcement Learning: A State-of-the-Art Walkthrough ⋮ Planning and navigation as active inference ⋮ Reinforcement learning endowed with safe veto policies to learn the control of linked-multicomponent robotic systems ⋮ Unifying temporal and organizational scales in multiscale decision-making ⋮ A BIOLOGICALLY INSPIRED HIERARCHICAL REINFORCEMENT LEARNING SYSTEM ⋮ Algebraic results and bottom-up algorithm for policies generalization in reinforcement learning using concept lattices ⋮ Transfer in variable-reward hierarchical reinforcement learning ⋮ Challenges of real-world reinforcement learning: definitions, benchmarks and analysis ⋮ Unnamed Item ⋮ Reinforcement learning in the brain ⋮ Reinforcement Learning in Sparse-Reward Environments With Hindsight Policy Gradients ⋮ Hierarchical clustering optimizes the tradeoff between compositionality and expressivity of task structures for flexible reinforcement learning ⋮ A Sufficient Statistic for Influence in Structured Multiagent Environments ⋮ Induction and Exploitation of Subgoal Automata for Reinforcement Learning ⋮ Model-based Reinforcement Learning: A Survey ⋮ Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning
This page was built for publication: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning