Adaptive playouts for online learning of policies during Monte Carlo tree search (Q307776)

scientific article; zbMATH DE number 6623300

Language	Label	Description	Also known as
default for all languages	No label defined
English	Adaptive playouts for online learning of policies during Monte Carlo tree search	scientific article; zbMATH DE number 6623300

Statements

instance of

scholarly article

0 references

title

Adaptive playouts for online learning of policies during Monte Carlo tree search (English)

0 references

0 references

0 references

Theoretical Computer Science

0 references

publication date

5 September 2016

0 references

zbMATH Keywords

Monte Carlo tree search

0 references

adaptive playouts

0 references

computer Go

0 references

reinforcement learning

0 references

describes a project that uses

PACHI

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1016/j.tcs.2016.06.029

0 references

cites work

PROGRESSIVE STRATEGIES FOR MONTE-CARLO TREE SEARCH

0 references

Using deep convolutional neural networks in Monte Carlo tree search

0 references

Variance reduction techniques for gradient estimates in reinforcement learning

0 references

Monte-Carlo simulation balancing in practice

0 references

Investigating the limits of Monte-Carlo tree search methods in computer Go

0 references

Efficiency of static knowledge bias in Monte-Carlo tree search

0 references

Algorithms for reinforcement learning.

0 references

Simple statistical gradient-following algorithms for connectionist reinforcement learning

0 references

Identifiers

zbMATH Open document ID

1370.68260

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

10.1016/J.TCS.2016.06.029

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:307776