Adaptive playouts for online learning of policies during Monte Carlo tree search

From MaRDI portal

(Redirected from Publication:307776)

Jump to:navigation, search

DOI10.1016/J.TCS.2016.06.029MaRDI QIDQ307776zbMATH OpenOpenAlexFDO

Authors Tobias Graf, Marco Platzner

Publication date 5 September 2016

Published in Theoretical Computer Science (Search for Journal in Brave)

Full work available at URL https://doi.org/10.1016/j.tcs.2016.06.029

zbMATH Keywords

reinforcement learning Monte Carlo tree search computer Go adaptive playouts

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Problem solving in the context of artificial intelligence (heuristics, search strategies, etc.) (68T20) Combinatorial games (91A46)

Recommendations

Cites work

Cited in

(3)

Describes a project that uses

Uses Software

PACHI

This page was built for publication: Adaptive playouts for online learning of policies during Monte Carlo tree search

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q307776)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Adaptive_playouts_for_online_learning_of_policies_during_Monte_Carlo_tree_search&oldid=60871629"