Introduction to Multi-Armed Bandits
Publication:5213200
DOI10.1561/2200000068zbMath1478.68006arXiv1904.07272OpenAlexW4206275166WikidataQ126833114 ScholiaQ126833114MaRDI QIDQ5213200
Publication date: 31 January 2020
Published in: Foundations and Trends® in Machine Learning (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1904.07272
Introductory exposition (textbooks, tutorial papers, etc.) pertaining to computer science (68-01) Learning and adaptive systems in artificial intelligence (68T05) Introductory exposition (textbooks, tutorial papers, etc.) pertaining to probability theory (60-01) Stopping times; optimal stopping problems; gambling theory (60G40) Rationality and learning in game theory (91A26) Multistage and repeated games (91A20) Randomized algorithms (68W20) Probabilistic games; gambling (91A60) Online algorithms; streaming algorithms (68W27)
Related Items (26)
This page was built for publication: Introduction to Multi-Armed Bandits