Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

scientific article; zbMATH DE number 1501821

From MaRDI portal
Publication:4502995
Jump to:navigation, search

zbMATH Open0963.68167MaRDI QIDQ4502995FDOQ4502995

Cangpu Wu, Guanghua Hu

Publication date: 4 September 2000



Title of this publication is not available (Why is that?)



Recommendations

  • Model-free average reward multi-step reinforcement learning
  • Average reward reinforcement learning: foundations, algorithms, and empirical results
  • Model-based average reward reinforcement learning
  • Epoch-incremental reinforcement learning algorithms
  • 10.1162/153244303765208377


zbMATH Keywords

learning algorithmR-learning


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Stochastic learning and adaptive control (93E35)



Cited In (2)

  • \(R(\lambda)\) imitation learning for automatic generation control of interconnected power grids
  • Model-free average reward multi-step reinforcement learning





This page was built for publication:

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4502995)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4502995&oldid=18594235"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 7 February 2024, at 07:53. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki