Better algorithms for benign bandits
From MaRDI portal
Publication:4633809
zbMATH Open1425.91084MaRDI QIDQ4633809FDOQ4633809
Authors: Elad Hazan, Satyen Kale
Publication date: 6 May 2019
Full work available at URL: https://dl.acm.org/citation.cfm?id=1496775
Recommendations
Learning and adaptive systems in artificial intelligence (68T05) Decision theory (91B06) Markov and semi-Markov decision processes (90C40) Probabilistic games; gambling (91A60)
Cited In (7)
- Title not available (Why is that?)
- Extracting certainty from uncertainty: regret bounded by variation in costs
- Algorithms for adversarial bandit problems with multiple plays
- Regret bounded by gradual variation for online convex optimization
- Greedy Algorithm Almost Dominates in Smoothed Contextual Bandits
- Weighted last-step min-max algorithm with improved sub-logarithmic regret
- Learning Theory
This page was built for publication: Better algorithms for benign bandits
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4633809)