Adaptive playouts for online learning of policies during Monte Carlo tree search
From MaRDI portal
(Redirected from Publication:307776)
Recommendations
Cites work
- Algorithms for reinforcement learning.
- Efficiency of static knowledge bias in Monte-Carlo tree search
- Investigating the limits of Monte-Carlo tree search methods in computer Go
- Monte-Carlo simulation balancing in practice
- PROGRESSIVE STRATEGIES FOR MONTE-CARLO TREE SEARCH
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Using deep convolutional neural networks in Monte Carlo tree search
- Variance reduction techniques for gradient estimates in reinforcement learning
Cited in
(3)
This page was built for publication: Adaptive playouts for online learning of policies during Monte Carlo tree search
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q307776)