Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback

From MaRDI portal

Publication:4596721

Jump to:navigation, search

DOI10.1137/140989455zbMath1375.68097arXiv1409.8428OpenAlexW2962927562MaRDI QIDQ4596721

Yishay Mansour, Nicolò Cesa-Bianchi, Ohad Shamir, Noga Alon, Claudio Gentile, Shie Mannor

Publication date: 8 December 2017

Published in: SIAM Journal on Computing (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1409.8428

zbMATH Keywords

graph theory online learning multi-armed bandits learning from experts learning with partial feedback

Mathematics Subject Classification ID

Analysis of algorithms (68W40) Learning and adaptive systems in artificial intelligence (68T05) Applications of game theory (91A80) Problem solving in the context of artificial intelligence (heuristics, search strategies, etc.) (68T20) Online algorithms; streaming algorithms (68W27)

Related Items

Best Arm Identification for Contaminated Bandits, Improved algorithms for bandit with graph feedback via regret decomposition, Unnamed Item, Unnamed Item, Small-Loss Bounds for Online Learning with Partial Information

Uses Software

AdaBoost.MH

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4596721&oldid=18755234"