Multi-policy iteration with a distributed voting.
From MaRDI portal
(Redirected from Publication:703154)
Recommendations
- A survey of some simulation-based algorithms for Markov decision processes
- CONVERGENCE OF SIMULATION-BASED POLICY ITERATION
- Simulation-based algorithms for Markov decision processes.
- Policy set iteration for Markov decision processes
- Actor-Critic--Type Learning Algorithms for Markov Decision Processes
This page was built for publication: Multi-policy iteration with a distributed voting.
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q703154)