Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion

From MaRDI portal
Publication:5485352












This page was built for publication: Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5485352)