Actor-critic algorithms for hierarchical Markov decision processes
From MaRDI portal
(Redirected from Publication:856510)
Recommendations
- Actor-Critic--Type Learning Algorithms for Markov Decision Processes
- OnActor-Critic Algorithms
- An actor-critic algorithm for constrained Markov decision processes
- Reinforcement learning based algorithms for average cost Markov decision processes
- An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes
Cites work
- scientific article; zbMATH DE number 1321699 (Why is no real title available?)
- scientific article; zbMATH DE number 2154249 (Why is no real title available?)
- A Simultaneous Perturbation Stochastic Approximation-Based Actor–Critic Algorithm for Markov Decision Processes
- A one-measurement form of simultaneous perturbation stochastic approximation
- Actor-Critic--Type Learning Algorithms for Markov Decision Processes
- Asynchronous stochastic approximation and Q-learning
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Multilayer control of large Markov chains
- Multitime scale markov decision processes
- Multivariate stochastic approximation using a simultaneous perturbation gradient approximation
- OnActor-Critic Algorithms
- Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences
Cited in
(7)- Actor-Critic--Type Learning Algorithms for Markov Decision Processes
- An Actor-Critic Algorithm With Second-Order Actor and Critic
- Recent advances in hierarchical reinforcement learning
- scientific article; zbMATH DE number 1560499 (Why is no real title available?)
- Reinforcement learning based algorithms for average cost Markov decision processes
- Multi-actor mechanism for actor-critic reinforcement learning
- An actor-critic algorithm for constrained Markov decision processes
This page was built for publication: Actor-critic algorithms for hierarchical Markov decision processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q856510)