A better resource allocation algorithm with semi-bandit feedback
From MaRDI portal
Publication:4617606
Recommendations
- On Bayesian index policies for sequential resource allocation
- An Efficient Algorithm for Learning with Semi-bandit Feedback
- Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part I: I.I.D. rewards
- Kullback-Leibler upper confidence bounds for optimal sequential allocation
- On the improvement of allocation rules for multi-armed bandit problem
Cited in
(3)
This page was built for publication: A better resource allocation algorithm with semi-bandit feedback
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4617606)