Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
Special pages
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

Multi-policy iteration with a distributed voting.

From MaRDI portal
Publication:703154
Jump to:navigation, search

DOI10.1007/S001860400362zbMATH Open1076.90065OpenAlexW1986055140MaRDI QIDQ703154FDOQ703154


Authors: Hyeong Soo Chang Edit this on Wikidata


Publication date: 11 January 2005

Published in: Mathematical Methods of Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s001860400362




Recommendations

  • A survey of some simulation-based algorithms for Markov decision processes
  • CONVERGENCE OF SIMULATION-BASED POLICY ITERATION
  • Simulation-based algorithms for Markov decision processes.
  • Policy set iteration for Markov decision processes
  • Actor-Critic--Type Learning Algorithms for Markov Decision Processes


Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40) Voting theory (91B12)



Cited In (1)

  • A distributed voting scheme to maximize preferences





This page was built for publication: Multi-policy iteration with a distributed voting.

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q703154)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:703154&oldid=12616323"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 30 January 2024, at 09:57. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki