Sample mean based index policies by O(log n) regret for the multi-armed bandit problem

From MaRDI portal
Publication:4862097







Cited in
(48)






This page was built for publication: Sample mean based index policies by O(log n) regret for the multi-armed bandit problem

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4862097)