Finding minimax strategy and minimax risk in a random environment (the two-armed bandit problem)
From MaRDI portal
(Redirected from Publication:664264)
Recommendations
- Finding minimax strategy and minimax risk for Bernoulli multi-armed bandit
- Publication:3479957
- Robust normal two-armed bandit and parallel data processing
- Parallel design of robust control in the stochastic environment (the two-armed bandit problem)
- Minimax and bates strategies for the discounted infinite horizon one-armed-bandit.—explicit formulae and structural properties
Cites work
- scientific article; zbMATH DE number 3831758 (Why is no real title available?)
- scientific article; zbMATH DE number 3845417 (Why is no real title available?)
- scientific article; zbMATH DE number 3359804 (Why is no real title available?)
- scientific article; zbMATH DE number 3372755 (Why is no real title available?)
- An Asymptotic Minimax Theorem for the Two Armed Bandit Problem
- Some Remarks on the Two-Armed Bandit
- Some aspects of the sequential design of experiments
Cited in
(10)- Improving the RHC-strategy
- Robust normal two-armed bandit and parallel data processing
- Finding minimax strategy and minimax risk for Bernoulli multi-armed bandit
- Gaussian two-armed bandit and optimization of batch data processing
- Poissonian two-armed bandit: a new approach
- scientific article; zbMATH DE number 3846699 (Why is no real title available?)
- scientific article; zbMATH DE number 4150069 (Why is no real title available?)
- Minimax and bates strategies for the discounted infinite horizon one-armed-bandit.—explicit formulae and structural properties
- One-armed bandit problem for parallel data processing systems
- Parallel design of robust control in the stochastic environment (the two-armed bandit problem)
This page was built for publication: Finding minimax strategy and minimax risk in a random environment (the two-armed bandit problem)
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q664264)