On the Optimal Reward Function of the Continuous Time Multiarmed Bandit Problem
From MaRDI portal
Publication:3200906
Recommendations
Cited in
(13)- A general theory of multiarmed bandit processes with constrained arm switches
- Regret and Convergence Bounds for a Class of Continuum-Armed Bandit Problems
- Minimax Off-Policy Evaluation for Multi-Armed Bandits
- Multi-armed bandit processes with optimal selection of the operating times
- Finite-time analysis of the multiarmed bandit problem
- scientific article; zbMATH DE number 736275 (Why is no real title available?)
- Randomization in the two-armed bandit problem
- The system of quasi-variational inequalities attached to the two-armed bandit problem
- Optimal activation of halting multi‐armed bandit models
- scientific article; zbMATH DE number 4064879 (Why is no real title available?)
- Bandit problems with Lévy processes
- Applicable stochastic control: From theory to practice
- On monotone optimal decision rules and the stay-on-a-winner rule for the two-armed bandit
This page was built for publication: On the Optimal Reward Function of the Continuous Time Multiarmed Bandit Problem
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3200906)