Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
Special pages
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

Undiscounted reinforcement learning algorithm based on performance potentials

From MaRDI portal
Publication:5754334
Jump to:navigation, search

zbMATH Open1123.68359MaRDI QIDQ5754334FDOQ5754334


Authors: Ruyi Zhou, Yang Gao Edit this on Wikidata


Publication date: 22 August 2007





Recommendations

  • From perturbation analysis to Markov decision processes and reinforcement learning
  • Unified NDP method based on TD(0) learning for both average and discounted Markov decision processes
  • scientific article; zbMATH DE number 2159039
  • Algorithms for reinforcement learning.
  • Performance optimization algorithms based on potentials for semi-Markov control processes


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)







This page was built for publication: Undiscounted reinforcement learning algorithm based on performance potentials

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5754334)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5754334&oldid=30516674"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 7 March 2024, at 05:01. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki