Exploration-exploitation tradeoff using variance estimates in multi-armed bandits

From MaRDI portal
Publication:1017665

DOI10.1016/J.TCS.2009.01.016zbMATH Open1167.68059OpenAlexW2142971854MaRDI QIDQ1017665FDOQ1017665


Authors: Jean-Yves Audibert, Rémi Munos, Csaba Szepesvári Edit this on Wikidata


Publication date: 12 May 2009

Published in: Theoretical Computer Science (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.tcs.2009.01.016




Recommendations




Cites Work


Cited In (45)





This page was built for publication: Exploration-exploitation tradeoff using variance estimates in multi-armed bandits

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1017665)