A note on the price of bandit feedback for mistake-bounded online learning
DOI10.1016/J.TCS.2021.05.009zbMATH Open1504.68085arXiv2101.06891OpenAlexW3160023654MaRDI QIDQ2034409FDOQ2034409
Authors: Jesse Geneson
Publication date: 22 June 2021
Published in: Theoretical Computer Science (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2101.06891
Recommendations
- New bounds on the price of bandit feedback for mistake-bounded online multiclass learning
- New bounds on the price of bandit feedback for mistake-bounded online multiclass learning
- Sharp bounds on the price of bandit feedback for several models of mistake-bounded online learning
- Mistake bounds on the noise-free multi-armed bandit game
- On-line learning with linear loss constraints.
Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05) Online algorithms; streaming algorithms (68W27) Computational learning theory (68Q32) Optimal stopping in statistics (62L15)
Cites Work
- Title not available (Why is that?)
- Pairwise independence and derandomization.
- Structural results about on-line learning models with and without queries
- Title not available (Why is that?)
- On the complexity of function learning
- New bounds on the price of bandit feedback for mistake-bounded online multiclass learning
Cited In (3)
This page was built for publication: A note on the price of bandit feedback for mistake-bounded online learning
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2034409)