Some reward–penalty rules for the multi-armed bandit problem which are asymptotically optimal

From MaRDI portal
Publication:4743532














This page was built for publication: Some reward–penalty rules for the multi-armed bandit problem which are asymptotically optimal

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4743532)