scientific article; zbMATH DE number 2086977
From MaRDI portal
Publication:4737593
zbMATH Open1077.68781MaRDI QIDQ4737593FDOQ4737593
Authors: B. Ravindran, Andrew G. Barto
Publication date: 11 August 2004
Full work available at URL: http://link.springer.de/link/service/series/0558/bibs/2371/23710196.htm
Title of this publication is not available (Why is that?)
Recommendations
- Equivalence notions and model minimization in Markov decision processes
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- scientific article; zbMATH DE number 1560499
- Recent advances in hierarchical reinforcement learning
- Abstraction and approximate decision-theoretic planning.
Cited In (6)
- Equivalence notions and model minimization in Markov decision processes
- A sufficient statistic for influence in structured multiagent environments
- Regret bounds for restless Markov bandits
- Bottom-up learning of hierarchical models in a class of deterministic pomdp environments
- A taxonomy for similarity metrics between Markov decision processes
- Before we can find a model, we must forget about perfection
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4737593)