Multi-armed bandits with episode context
From MaRDI portal
Publication:766259
Recommendations
- Introduction to multi-armed bandits
- Multi-armed bandit problem revisited
- Multi-armed bandits in discrete and continuous time
- Multi-objective Contextual Multi-armed Bandit With a Dominant Objective
- The multi-armed bandit, with constraints
- Multi-armed bandit with sub-exponential rewards
- The Multi-Armed Bandit With Stochastic Plays
Cites work
- A Simple Distribution-Free Approach to the Max k-Armed Bandit Problem
- Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
- Computer Go: An AI oriented survey
- Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
- Finite-time analysis of the multiarmed bandit problem
- Multi-armed bandits with episode context
- PROGRESSIVE STRATEGIES FOR MONTE-CARLO TREE SEARCH
- Pure exploration in multi-armed bandits problems
- The Nonstochastic Multiarmed Bandit Problem
- The sample complexity of exploration in the multi-armed bandit problem
Cited in
(5)
This page was built for publication: Multi-armed bandits with episode context
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q766259)