A dynamic programming strategy to balance exploration and exploitation in the bandit problem

From MaRDI portal
Publication:647433

DOI10.1007/S10472-010-9190-1zbMATH Open1226.68079OpenAlexW2052471706MaRDI QIDQ647433FDOQ647433


Authors: Olivier Caelen, Gianluca Bontempi Edit this on Wikidata


Publication date: 23 November 2011

Published in: Annals of Mathematics and Artificial Intelligence (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10472-010-9190-1




Recommendations




Cites Work


Cited In (1)

Uses Software





This page was built for publication: A dynamic programming strategy to balance exploration and exploitation in the bandit problem

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q647433)