Small-Loss Bounds for Online Learning with Partial Information

From MaRDI portal

Publication:5868953

Jump to:navigation, search

DOI10.1287/moor.2021.1204MaRDI QIDQ5868953

Éva Tardos, Karthik Sridharan, Thodoris Lykouris

Publication date: 26 September 2022

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1711.03639

zbMATH Keywords

online learning; partial information; contextual bandits; regret bounds; bandit algorithms; high probability; feedback graphs; first-order bounds; semi-bandits; small-loss bounds

Mathematics Subject Classification ID

68Q32: Computational learning theory

Uses Software

AdaBoost.MH

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5868953&oldid=30718661"