Some reward–penalty rules for the multi-armed bandit problem which are asymptotically optimal
From MaRDI portal
Publication:4743532
This page was built for publication: Some reward–penalty rules for the multi-armed bandit problem which are asymptotically optimal
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4743532)