Temporal-difference search in Computer Go
DOI10.1007/S10994-012-5280-0zbMATH Open1238.91044OpenAlexW2153039919MaRDI QIDQ420936FDOQ420936
Authors: David Silver, Richard S. Sutton, Martin Müller
Publication date: 23 May 2012
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10994-012-5280-0
Recommendations
reinforcement learningComputer GoMonte Carlo searchsimulation based searchtemporal-difference learning
Monte Carlo methods (65C05) Markov and semi-Markov decision processes (90C40) Combinatorial games (91A46) Computational methods for problems pertaining to game theory, economics, and finance (91-08)
Cites Work
- Title not available (Why is that?)
- Title not available (Why is that?)
- Finite-time analysis of the multiarmed bandit problem
- Amazons Discover Monte-Carlo
- Whole-History Rating: A Bayesian Rating System for Players of Time-Varying Strength
- Analytical mean squared error curves for temporal difference learning
- Convergence results for single-step on-policy reinforcement-learning algorithms
- Learning to play chess using temporal differences
- 10.1162/153244303768966102
- An Analysis of UCT in Multi-player Games
- Computer Go
Cited In (13)
- On Monte Carlo tree search and reinforcement learning
- The \(PN^{*}\)-search algorithm: Application to tsume-shogi
- Monte-Carlo approximiation of temperature
- Title not available (Why is that?)
- Goal threats, temperature, and Monte-Carlo Go
- Learning to play chess using temporal differences
- Artificial Intelligence and Soft Computing - ICAISC 2004
- Simulation-based search
- Using deep convolutional neural networks in Monte Carlo tree search
- Spatial state-action features for general games
- MONTE CARLO GO CAPTURING TACTIC SEARCH
- Temporal difference learning applied to game playing and the results of application to Shogi
- Default policies for global optimisation of noisy functions with severe noise
This page was built for publication: Temporal-difference search in Computer Go
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q420936)