A note on structural properties of the Bernoulli two-armed bandit problem
From MaRDI portal
Publication:4742584
DOI10.1080/02331938208842808zbMath0505.90080OpenAlexW2014864288MaRDI QIDQ4742584
Radu Theodorescu, Dieter Kalin
Publication date: 1982
Published in: Mathematische Operationsforschung und Statistik. Series Optimization (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1080/02331938208842808
finite horizonmonotonicity propertyBernoulli two-armed bandit problemoptimal expected cumulative discounted reward
Related Items (3)
On the Bernoulli three-armed bandit problem ⋮ A Two-Armed Bandit Problem with possibility of no Information ⋮ On monotone optimal decision rules and the stay-on-a-winner rule for the two-armed bandit
This page was built for publication: A note on structural properties of the Bernoulli two-armed bandit problem