Actor-critic algorithms for hierarchical Markov decision processes
DOI10.1016/J.AUTOMATICA.2005.12.010zbMATH Open1102.93043OpenAlexW2027220971MaRDI QIDQ856510FDOQ856510
Authors: Shalabh Bhatnagar, J. Ranjan Panigrahi
Publication date: 7 December 2006
Published in: Automatica (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.automatica.2005.12.010
Recommendations
- Actor-Critic--Type Learning Algorithms for Markov Decision Processes
- OnActor-Critic Algorithms
- An actor-critic algorithm for constrained Markov decision processes
- Reinforcement learning based algorithms for average cost Markov decision processes
- An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes
stochastic approximationhierarchical decision makingoptimal controllearning algorithmsMarkov decision processes
Stochastic systems in control theory (general) (93E03) Optimal stochastic control (93E20) Stochastic learning and adaptive control (93E35)
Cites Work
- Title not available (Why is that?)
- Multivariate stochastic approximation using a simultaneous perturbation gradient approximation
- OnActor-Critic Algorithms
- Actor-Critic--Type Learning Algorithms for Markov Decision Processes
- Asynchronous stochastic approximation and Q-learning
- Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences
- A Simultaneous Perturbation Stochastic Approximation-Based Actor–Critic Algorithm for Markov Decision Processes
- A one-measurement form of simultaneous perturbation stochastic approximation
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Title not available (Why is that?)
- Multitime scale markov decision processes
- Multilayer control of large Markov chains
Cited In (7)
- Recent advances in hierarchical reinforcement learning
- Actor-Critic--Type Learning Algorithms for Markov Decision Processes
- An Actor-Critic Algorithm With Second-Order Actor and Critic
- Multi-actor mechanism for actor-critic reinforcement learning
- An actor-critic algorithm for constrained Markov decision processes
- Reinforcement learning based algorithms for average cost Markov decision processes
- Title not available (Why is that?)
This page was built for publication: Actor-critic algorithms for hierarchical Markov decision processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q856510)