LinUCB applied to Monte Carlo tree search
From MaRDI portal
Publication:307792
DOI10.1016/J.TCS.2016.06.035zbMATH Open1370.68266OpenAlexW2473144994MaRDI QIDQ307792FDOQ307792
Authors: Yusaku Mandai, Tomoyuki Kaneko
Publication date: 5 September 2016
Published in: Theoretical Computer Science (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.tcs.2016.06.035
Recommendations
Learning and adaptive systems in artificial intelligence (68T05) Problem solving in the context of artificial intelligence (heuristics, search strategies, etc.) (68T20) Combinatorial games (91A46)
Cites Work
- Finite-time analysis of the multiarmed bandit problem
- Deep Blue
- Regret analysis of stochastic and nonstochastic multi-armed bandit problems
- An analysis of alpha-beta pruning
- Best-first minimax search
- Large-Scale Optimization for Evaluation Functions with Minimax Search
- Computer Go: An AI oriented survey
- Multi-armed bandits with episode context
Cited In (3)
This page was built for publication: LinUCB applied to Monte Carlo tree search
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q307792)