Temporal-difference search in Computer Go (Q420936): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
(3 intermediate revisions by 3 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/s10994-012-5280-0 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2153039919 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning to play chess using temporal differences / rank
 
Normal rank
Property / cites work
 
Property / cites work: Whole-History Rating: A Bayesian Rating System for Players of Time-Varying Strength / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4000104 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Analytical mean squared error curves for temporal difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Amazons Discover Monte-Carlo / rank
 
Normal rank
Property / cites work
 
Property / cites work: Computer Go / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence results for single-step on-policy reinforcement-learning algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Analysis of UCT in Multi-player Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/153244303768966102 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3724211 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 06:59, 5 July 2024

scientific article
Language Label Description Also known as
English
Temporal-difference search in Computer Go
scientific article

    Statements

    Temporal-difference search in Computer Go (English)
    0 references
    0 references
    0 references
    23 May 2012
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    reinforcement learning
    0 references
    temporal-difference learning
    0 references
    Monte Carlo search
    0 references
    simulation based search
    0 references
    Computer Go
    0 references
    0 references