Regret bounds for sleeping experts and bandits

From MaRDI portal

Publication:1959599

Jump to:navigation, search

DOI10.1007/s10994-010-5178-7zbMath1370.68254OpenAlexW2008098735MaRDI QIDQ1959599

Alexandru Niculescu-Mizil, Yogeshwer Sharma, Robert D. Kleinberg

Publication date: 7 October 2010

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10994-010-5178-7

zbMATH Keywords

computational learning theory online algorithms regret

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Online algorithms; streaming algorithms (68W27)

Related Items (10)

Sleeping experts and bandits approach to constrained Markov decision processes ⋮ The \(K\)-armed dueling bandits problem ⋮ Finite-Time Analysis for the Knowledge-Gradient Policy ⋮ Ballooning multi-armed bandits ⋮ Near-Optimal Algorithms for Online Matrix Prediction ⋮ Truthful Mechanisms with Implicit Payment Computation ⋮ Learning Hurdles for Sleeping Experts ⋮ Online Collaborative Filtering on Graphs ⋮ Unnamed Item ⋮ A unified framework for online trip destination prediction

Cites Work

This page was built for publication: Regret bounds for sleeping experts and bandits

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1959599&oldid=14400689"