Multi-armed bandits with episode context
From MaRDI portal
Publication:766259
DOI10.1007/S10472-011-9258-6zbMATH Open1234.68171OpenAlexW1977989560WikidataQ104573352 ScholiaQ104573352MaRDI QIDQ766259FDOQ766259
Authors: Christopher D. Rosin
Publication date: 23 March 2012
Published in: Annals of Mathematics and Artificial Intelligence (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10472-011-9258-6
Recommendations
- Introduction to multi-armed bandits
- Multi-armed bandit problem revisited
- Multi-armed bandits in discrete and continuous time
- Multi-objective Contextual Multi-armed Bandit With a Dominant Objective
- The multi-armed bandit, with constraints
- Multi-armed bandit with sub-exponential rewards
- The Multi-Armed Bandit With Stochastic Plays
Learning and adaptive systems in artificial intelligence (68T05) Computational learning theory (68Q32)
Cites Work
- Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
- Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
- Pure exploration in multi-armed bandits problems
- The Nonstochastic Multiarmed Bandit Problem
- Finite-time analysis of the multiarmed bandit problem
- The sample complexity of exploration in the multi-armed bandit problem
- PROGRESSIVE STRATEGIES FOR MONTE-CARLO TREE SEARCH
- Computer Go: An AI oriented survey
- Multi-armed bandits with episode context
- A Simple Distribution-Free Approach to the Max k-Armed Bandit Problem
Cited In (5)
This page was built for publication: Multi-armed bandits with episode context
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q766259)