Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion

From MaRDI portal

Publication:5485352

Jump to:navigation, search

DOI10.1017/S0269964800004149MaRDI QIDQ5485352zbMATH OpenFDO

Authors Stephen J. Herschkorn, Erol A. Peköz, Sheldon M. Ross

Publication date 30 August 2006

Published in Probability in the Engineering and Informational Sciences (Search for Journal in Brave)

Mathematics Subject Classification ID

Sequential statistical methods (62L99) Special processes (60K99) Stochastic games, stochastic differential games (91A15)

Recommendations

Cites work

Cited in

(7)

This page was built for publication: Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5485352)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5485352&oldid=30040326"