Small-Loss Bounds for Online Learning with Partial Information
From MaRDI portal
Publication:5868953
DOI10.1287/moor.2021.1204MaRDI QIDQ5868953
Éva Tardos, Karthik Sridharan, Thodoris Lykouris
Publication date: 26 September 2022
Published in: Mathematics of Operations Research (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1711.03639
online learning; partial information; contextual bandits; regret bounds; bandit algorithms; high probability; feedback graphs; first-order bounds; semi-bandits; small-loss bounds
68Q32: Computational learning theory
Uses Software