Multi-policy iteration with a distributed voting.
From MaRDI portal
Publication:703154
DOI10.1007/S001860400362zbMATH Open1076.90065OpenAlexW1986055140MaRDI QIDQ703154FDOQ703154
Authors: Hyeong Soo Chang
Publication date: 11 January 2005
Published in: Mathematical Methods of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s001860400362
Recommendations
- A survey of some simulation-based algorithms for Markov decision processes
- CONVERGENCE OF SIMULATION-BASED POLICY ITERATION
- Simulation-based algorithms for Markov decision processes.
- Policy set iteration for Markov decision processes
- Actor-Critic--Type Learning Algorithms for Markov Decision Processes
Cited In (1)
This page was built for publication: Multi-policy iteration with a distributed voting.
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q703154)