scientific article; zbMATH DE number 2086977
From MaRDI portal
Publication:4737593
Recommendations
- Equivalence notions and model minimization in Markov decision processes
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- scientific article; zbMATH DE number 1560499
- Recent advances in hierarchical reinforcement learning
- Abstraction and approximate decision-theoretic planning.
Cited in
(6)- Equivalence notions and model minimization in Markov decision processes
- A sufficient statistic for influence in structured multiagent environments
- Regret bounds for restless Markov bandits
- Bottom-up learning of hierarchical models in a class of deterministic pomdp environments
- A taxonomy for similarity metrics between Markov decision processes
- Before we can find a model, we must forget about perfection
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4737593)