Extensions of the multiarmed bandit problem: The discounted case
From MaRDI portal
Publication:3682272
Recommendations
Cited in
(70)- A bisection/successive approximation method for computing Gittins indices
- Index policy for multiarmed bandit problem with dynamic risk measures
- On the Worth of Perfect Information in Bandits with Random Discounting
- Optimistic Gittins Indices
- A doscounted uniform one-armed bandit problem
- Open bandit processes with uncountable states and time-backward effects
- scientific article; zbMATH DE number 548896 (Why is no real title available?)
- Optimal strategies for families of alternative bandit processes
- Competing Markov decision processes
- On the problem of the two-armed bandit with impulse controls and discounting
- Stationary multi-choice bandit problems.
- A general theory of multiarmed bandit processes with constrained arm switches
- Dynamic priority allocation via restless bandit marginal productivity indices
- New results for generalized bandit problems
- scientific article; zbMATH DE number 4078557 (Why is no real title available?)
- On an Optimal Stopping Problem for Multi-Parameter Diffusion Processes
- Derman's book as inspiration: some results on LP for MDPs
- The multi-armed bandit, with constraints
- Asymptotic properties of bandit processes with geometric responses.
- Sequencing an N-Stage Process with Feedback
- Optimal stopping problems for multiarmed bandit processes with arms' independence
- Dynamic stochastic dominance in bandit decision problems
- A generalized Gittins index for a Markov chain and its recursive calculation
- Four proofs of Gittins' multiarmed bandit theorem
- On the Gittins index in the M/G/1 queue
- Optimal control of single-server queueing networks
- scientific article; zbMATH DE number 3891095 (Why is no real title available?)
- Sample path methods in the control of queues
- Reading policies for joins: an asymptotic analysis
- A comparative study of ad hoc techniques and evolutionary methods for multi-armed bandit problems
- A survey of Markov decision models for control of networks of queues
- Performance evaluation of scheduling control of queueing networks: Fluid model heuristics
- Multi-armed bandits under general depreciation and commitment
- Multi-armed bandits in discrete and continuous time
- scientific article; zbMATH DE number 4056829 (Why is no real title available?)
- The Multi-Armed Bandit Problem: Decomposition and Computation
- Optimal stopping for Brownian motion with applications to sequential analysis and option pricing
- The one-armed \(\mathrm{Erlang}(k)\) bandit reward process
- Simultaneous optimization of flow control and scheduling in a single server queue with two job classes
- Simultaneous optimization of flow-control and scheduling in a single server queue with two job classes: Numerical results and approximation
- Discrete multiarmed bandits and multiparameter processes
- Survey of linear programming for standard and nonstandard Markovian control problems. Part II: Applications
- Tax problems in the undiscounted case
- Dynamic allocation policies for the finite horizon one armed bandit problem
- Denumerable-Armed Bandits
- Branching Bandit Processes
- Discounted Multiarmed Bandit Problems on a Collection of Machines with Varying Speeds
- Bandit and covariate processes, with finite or non-denumerable set of arms
- A perpetual search for talents across overlapping generations: a learning process
- Stochastic scheduling of parallel queues with set-up costs
- Evaluating strategies for generalized bandit problems
- Multi-armed bandit problem revisited
- Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges
- scientific article; zbMATH DE number 3854141 (Why is no real title available?)
- On Gittins' index theorem in continuous time
- A new algorithm for the multi-item exponentially discounted optimal selection problem.
- Stochastic scheduling and forwards induction
- On the evaluation of strategies for branching bandit processes
- Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems
- Optimal, recursive procedures of identification
- Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation
- The archievable region method in the optimal control of queueing systems; formulations, bounds and policies
- Generalized Bandit Problems
- Resource capacity allocation to stochastic dynamic competitors: knapsack problem for perishable items and index-knapsack heuristic
- Flow time distributions in a \(K\) class \(M/G/1\) priority feedback queue
- Independently Expiring Multiarmed Bandits
- A generalized Kalman filter for fixed point approximation and efficient temporal-difference learning
- Reinforcement learning and evolutionary algorithms for non-stationary multi-armed bandit problems
- Optimal intensity control of a multi-class queue
- scientific article; zbMATH DE number 47588 (Why is no real title available?)
This page was built for publication: Extensions of the multiarmed bandit problem: The discounted case
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3682272)