Temporal-difference search in Computer Go
From MaRDI portal
Recommendations
Cites work
- scientific article; zbMATH DE number 3954793 (Why is no real title available?)
- scientific article; zbMATH DE number 50447 (Why is no real title available?)
- 10.1162/153244303768966102
- Amazons Discover Monte-Carlo
- An Analysis of UCT in Multi-player Games
- Analytical mean squared error curves for temporal difference learning
- Computer Go
- Convergence results for single-step on-policy reinforcement-learning algorithms
- Finite-time analysis of the multiarmed bandit problem
- Learning to play chess using temporal differences
- Whole-History Rating: A Bayesian Rating System for Players of Time-Varying Strength
Cited in
(13)- Artificial Intelligence and Soft Computing - ICAISC 2004
- Temporal difference learning applied to game playing and the results of application to Shogi
- MONTE CARLO GO CAPTURING TACTIC SEARCH
- On Monte Carlo tree search and reinforcement learning
- Goal threats, temperature, and Monte-Carlo Go
- Spatial state-action features for general games
- Simulation-based search
- The \(PN^{*}\)-search algorithm: Application to tsume-shogi
- scientific article; zbMATH DE number 1759682 (Why is no real title available?)
- Learning to play chess using temporal differences
- Default policies for global optimisation of noisy functions with severe noise
- Using deep convolutional neural networks in Monte Carlo tree search
- Monte-Carlo approximiation of temperature
This page was built for publication: Temporal-difference search in Computer Go
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q420936)