Bandit problems with Lévy processes
From MaRDI portal
Publication:5169656
Abstract: Bandit problems model the trade-off between exploration and exploitation in various decision problems. We study two-armed bandit problems in continuous time, where the risky arm can have two types: High or Low; both types yield stochastic payoffs generated by a Levy process. We show that the optimal strategy is a cut-off strategy and we provide an explicit expression for the cut-off and for the optimal payoff.
Recommendations
- Lévy bandits: Multi-armed bandits driven by Lévy processes
- Optimal stopping problems in Lévy models with random observations
- The two-armed bandit problem with continuous time in the presence of gradual and impulsive controls
- scientific article; zbMATH DE number 503440
- On the Optimal Reward Function of the Continuous Time Multiarmed Bandit Problem
Cited in
(17)- A bandit process with delayed responses
- scientific article; zbMATH DE number 5243902 (Why is no real title available?)
- scientific article; zbMATH DE number 4174703 (Why is no real title available?)
- Undiscounted bandit games
- Parameter estimation: the proper way to use Bayesian posterior processes with Brownian noise
- Batched bandit problems
- Bandits and Experts in Metric Spaces
- Optimal stopping problems in Lévy models with random observations
- A stochastic differential equation driven by Poisson random measure and its application in a duopoly market
- Common value experimentation
- Optimal learning with non-Gaussian rewards
- A common value experimentation with multiarmed bandits
- scientific article; zbMATH DE number 4059270 (Why is no real title available?)
- An exact bandit model for the risk-volatility tradeoff
- Two-armed restless bandits with imperfect information: stochastic control and indexability
- Lévy bandits under Poissonian decision times
- Lévy bandits: Multi-armed bandits driven by Lévy processes
This page was built for publication: Bandit problems with Lévy processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5169656)