Temporal-difference search in Computer Go (Q420936): Difference between revisions
From MaRDI portal
Created a new Item |
ReferenceBot (talk | contribs) Changed an Item |
||
(4 intermediate revisions by 4 users not shown) | |||
Property / author | |||
Property / author: H. S. Yoon / rank | |||
Normal rank | |||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 91A46 / rank | |||
Normal rank | |||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 91-08 / rank | |||
Normal rank | |||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 90C40 / rank | |||
Normal rank | |||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 65C05 / rank | |||
Normal rank | |||
Property / zbMATH DE Number | |||
Property / zbMATH DE Number: 6037853 / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
reinforcement learning | |||
Property / zbMATH Keywords: reinforcement learning / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
temporal-difference learning | |||
Property / zbMATH Keywords: temporal-difference learning / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
Monte Carlo search | |||
Property / zbMATH Keywords: Monte Carlo search / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
simulation based search | |||
Property / zbMATH Keywords: simulation based search / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
Computer Go | |||
Property / zbMATH Keywords: Computer Go / rank | |||
Normal rank | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1007/s10994-012-5280-0 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2153039919 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Learning to play chess using temporal differences / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Whole-History Rating: A Bayesian Rating System for Players of Time-Varying Strength / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4000104 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Analytical mean squared error curves for temporal difference learning / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Amazons Discover Monte-Carlo / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Computer Go / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Convergence results for single-step on-policy reinforcement-learning algorithms / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: An Analysis of UCT in Multi-player Games / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: 10.1162/153244303768966102 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3724211 / rank | |||
Normal rank | |||
links / mardi / name | links / mardi / name | ||
Latest revision as of 05:59, 5 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Temporal-difference search in Computer Go |
scientific article |
Statements
Temporal-difference search in Computer Go (English)
0 references
23 May 2012
0 references
reinforcement learning
0 references
temporal-difference learning
0 references
Monte Carlo search
0 references
simulation based search
0 references
Computer Go
0 references