Temporal-difference search in Computer Go
From MaRDI portal
Publication:420936
DOI10.1007/s10994-012-5280-0zbMath1238.91044MaRDI QIDQ420936
Publication date: 23 May 2012
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10994-012-5280-0
reinforcement learning; Computer Go; Monte Carlo search; simulation based search; temporal-difference learning
65C05: Monte Carlo methods
90C40: Markov and semi-Markov decision processes
91-08: Computational methods for problems pertaining to game theory, economics, and finance
91A46: Combinatorial games