Sharp bounds on the price of bandit feedback for several models of mistake-bounded online learning

From MaRDI portal

Publication:6162075

Jump to:navigation, search

DOI10.1016/j.tcs.2023.113980zbMath1517.68150arXiv2209.01366OpenAlexW4378675206MaRDI QIDQ6162075

No author found.

Publication date: 15 June 2023

Published in: Theoretical Computer Science (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2209.01366

zbMATH Keywords

learning theory online learning mistake-bound model bandit feedback

Mathematics Subject Classification ID

Computational learning theory (68Q32) Learning and adaptive systems in artificial intelligence (68T05) Online algorithms; streaming algorithms (68W27)

Cites Work

This page was built for publication: Sharp bounds on the price of bandit feedback for several models of mistake-bounded online learning

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6162075&oldid=35640135"