Exponential two-armed bandit problem
From MaRDI portal
Publication:6323698
arXiv1908.05531MaRDI QIDQ6323698FDOQ6323698
Authors: Alex V. Kolnogorov, Denis Grunev
Publication date: 15 August 2019
Abstract: We consider exponential two-armed bandit problem in which incomes are described by exponential distribution densities. We develop Bayesian approach and present recursive equation for determination of Bayesian strategy and Bayesian risk. In the limiting case as the control horizon goes to infinity, we obtain the second order partial differential equation in the domain of "close distributions". Results are compared with Gaussian two-armed bandit. It turned out that exponential and Gaussian two-armed bandits have the same description in the limiting case. Since Gaussian two-armed bandit describes the batch processing, this means that in case of exponential two-armed bandit batch processing does not enlarge Bayesian risk in comparison with one-by-one optimal processing as the total number of processed data items goes to infinity.
Sequential statistical design (62L05) Bayesian problems; characterization of Bayes procedures (62C10) Minimax procedures in statistical decision theory (62C20) Robustness and adaptive procedures (parametric inference) (62F35) Optimal stochastic control (93E20)
This page was built for publication: Exponential two-armed bandit problem
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6323698)