Temporal-difference search in Computer Go (Q420936): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(4 intermediate revisions by 4 users not shown)
Property / author
 
Property / author: H. S. Yoon / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 91A46 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 91-08 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 90C40 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 65C05 / rank
 
Normal rank
Property / zbMATH DE Number
 
Property / zbMATH DE Number: 6037853 / rank
 
Normal rank
Property / zbMATH Keywords
 
reinforcement learning
Property / zbMATH Keywords: reinforcement learning / rank
 
Normal rank
Property / zbMATH Keywords
 
temporal-difference learning
Property / zbMATH Keywords: temporal-difference learning / rank
 
Normal rank
Property / zbMATH Keywords
 
Monte Carlo search
Property / zbMATH Keywords: Monte Carlo search / rank
 
Normal rank
Property / zbMATH Keywords
 
simulation based search
Property / zbMATH Keywords: simulation based search / rank
 
Normal rank
Property / zbMATH Keywords
 
Computer Go
Property / zbMATH Keywords: Computer Go / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/s10994-012-5280-0 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2153039919 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning to play chess using temporal differences / rank
 
Normal rank
Property / cites work
 
Property / cites work: Whole-History Rating: A Bayesian Rating System for Players of Time-Varying Strength / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4000104 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Analytical mean squared error curves for temporal difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Amazons Discover Monte-Carlo / rank
 
Normal rank
Property / cites work
 
Property / cites work: Computer Go / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence results for single-step on-policy reinforcement-learning algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Analysis of UCT in Multi-player Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/153244303768966102 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3724211 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 05:59, 5 July 2024

scientific article
Language Label Description Also known as
English
Temporal-difference search in Computer Go
scientific article

    Statements

    Temporal-difference search in Computer Go (English)
    0 references
    0 references
    0 references
    23 May 2012
    0 references
    reinforcement learning
    0 references
    temporal-difference learning
    0 references
    Monte Carlo search
    0 references
    simulation based search
    0 references
    Computer Go
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references