Temporal-difference search in Computer Go (Q420936): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning to play chess using temporal differences / rank
 
Normal rank
Property / cites work
 
Property / cites work: Whole-History Rating: A Bayesian Rating System for Players of Time-Varying Strength / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4000104 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Analytical mean squared error curves for temporal difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Amazons Discover Monte-Carlo / rank
 
Normal rank
Property / cites work
 
Property / cites work: Computer Go / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence results for single-step on-policy reinforcement-learning algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Analysis of UCT in Multi-player Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/153244303768966102 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3724211 / rank
 
Normal rank

Latest revision as of 05:59, 5 July 2024

scientific article
Language Label Description Also known as
English
Temporal-difference search in Computer Go
scientific article

    Statements

    Temporal-difference search in Computer Go (English)
    0 references
    0 references
    0 references
    23 May 2012
    0 references
    reinforcement learning
    0 references
    temporal-difference learning
    0 references
    Monte Carlo search
    0 references
    simulation based search
    0 references
    Computer Go
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references