The analysis of experimental results of reinforcement learning systems
From MaRDI portal
Publication:698951
zbMATH Open1021.68043MaRDI QIDQ698951FDOQ698951
Authors: Jaroslav E. Poliscuk
Publication date: 30 September 2002
Published in: Computer Science Journal of Moldova (Search for Journal in Brave)
Recommendations
- AN ANALYSIS OF EXPERIENCE REPLAY IN TEMPORAL DIFFERENCE LEARNING
- Experimental analysis on Sarsa(λ) and Q(λ) with different eligibility traces strategies
- TD(λ) learning without eligibility traces: a theoretical analysis
- Reinforcement learning theory, algorithms and its application
- scientific article; zbMATH DE number 1950579
Cited In (4)
- Reliability of internal prediction/estimation and its application. I: Adaptive action selection reflecting reliability of value function
- AN ANALYSIS OF EXPERIENCE REPLAY IN TEMPORAL DIFFERENCE LEARNING
- Improved SARSA and DQN algorithms for reinforcement learning
- Experimental analysis on Sarsa(λ) and Q(λ) with different eligibility traces strategies
This page was built for publication: The analysis of experimental results of reinforcement learning systems
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q698951)