Simulation-based search
From MaRDI portal
Publication:6198646
DOI10.4171/icm2022/180OpenAlexW4389775000MaRDI QIDQ6198646
David Silver, André M. S. Barreto
Publication date: 20 March 2024
Published in: International Congress of Mathematicians (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.4171/icm2022/180
Markov decision processesreinforcement learningplanningMonte-Carlo tree searchMonte-Carlo simulation
Learning and adaptive systems in artificial intelligence (68T05) Proceedings, conferences, collections, etc. pertaining to computer science (68-06) Reasoning under uncertainty in the context of artificial intelligence (68T37) Markov and semi-Markov decision processes (90C40)
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Temporal-difference search in Computer Go
- Model predictive control: Theory and practice - a survey
- Convergence results for single-step on-policy reinforcement-learning algorithms
- A sparse sampling algorithm for near-optimal planning in large Markov decision processes
- A comparison of minimax tree search algorithms
- Approximate Dynamic Programming
- A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
- Deep Blue
- World-championship-caliber Scrabble*
- Finite-time analysis of the multiarmed bandit problem
This page was built for publication: Simulation-based search