Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
Special pages
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

The analysis of experimental results of reinforcement learning systems

From MaRDI portal
Publication:698951
Jump to:navigation, search

zbMATH Open1021.68043MaRDI QIDQ698951FDOQ698951


Authors: Jaroslav E. Poliscuk Edit this on Wikidata


Publication date: 30 September 2002

Published in: Computer Science Journal of Moldova (Search for Journal in Brave)





Recommendations

  • AN ANALYSIS OF EXPERIENCE REPLAY IN TEMPORAL DIFFERENCE LEARNING
  • Experimental analysis on Sarsa(λ) and Q(λ) with different eligibility traces strategies
  • TD(λ) learning without eligibility traces: a theoretical analysis
  • Reinforcement learning theory, algorithms and its application
  • scientific article; zbMATH DE number 1950579


zbMATH Keywords

Markov decision making process


Mathematics Subject Classification ID

Computational learning theory (68Q32)



Cited In (4)

  • Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function
  • AN ANALYSIS OF EXPERIENCE REPLAY IN TEMPORAL DIFFERENCE LEARNING
  • Improved SARSA and DQN algorithms for reinforcement learning
  • Experimental analysis on Sarsa(λ) and Q(λ) with different eligibility traces strategies





This page was built for publication: The analysis of experimental results of reinforcement learning systems

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q698951)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:698951&oldid=12616003"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 30 January 2024, at 09:57. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki