Tsallis-INF: an optimal algorithm for stochastic and adversarial bandits

From MaRDI portal

Publication:4998901

Jump to:navigation, search

MaRDI QIDQ4998901FDOQ4998901

Authors: Julian Zimmert, Yevgeny Seldin

Publication date: 9 July 2021

Full work available at URL: https://arxiv.org/abs/1807.07623

Recommendations

zbMATH Keywords

stochastic online learning Tsallis entropy multi-armed bandits bandits adversarial best of both worlds I.I.D.online mirror descent

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)

Cites Work

Cited In (6)

This page was built for publication: Tsallis-INF: an optimal algorithm for stochastic and adversarial bandits

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4998901)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4998901&oldid=19451965"