Pages that link to "Item:Q3682272"
From MaRDI portal
The following pages link to Extensions of the multiarmed bandit problem: The discounted case (Q3682272):
Displaying 49 items.
- Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges (Q254442) (← links)
- Resource capacity allocation to stochastic dynamic competitors: knapsack problem for perishable items and index-knapsack heuristic (Q333075) (← links)
- Four proofs of Gittins' multiarmed bandit theorem (Q333080) (← links)
- The multi-armed bandit, with constraints (Q378726) (← links)
- Derman's book as inspiration: some results on LP for MDPs (Q378728) (← links)
- Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation (Q490352) (← links)
- A generalized Kalman filter for fixed point approximation and efficient temporal-difference learning (Q859737) (← links)
- A perpetual search for talents across overlapping generations: a learning process (Q898767) (← links)
- Dynamic priority allocation via restless bandit marginal productivity indices (Q926578) (← links)
- A generalized Gittins index for a Markov chain and its recursive calculation (Q945795) (← links)
- A comparative study of ad hoc techniques and evolutionary methods for multi-armed bandit problems (Q949395) (← links)
- Flow time distributions in a \(K\) class \(M/G/1\) priority feedback queue (Q1175219) (← links)
- On the evaluation of strategies for branching bandit processes (Q1178448) (← links)
- Multi-armed bandits in discrete and continuous time (Q1296724) (← links)
- Discrete multiarmed bandits and multiparameter processes (Q1317211) (← links)
- Optimal stopping problems for multiarmed bandit processes with arms' independence (Q1324371) (← links)
- Multi-armed bandit problem revisited (Q1337211) (← links)
- Stochastic scheduling and forwards induction (Q1346693) (← links)
- Optimal stopping for Brownian motion with applications to sequential analysis and option pricing (Q1763432) (← links)
- A survey of Markov decision models for control of networks of queues (Q1801813) (← links)
- Optimal intensity control of a multi-class queue (Q1825531) (← links)
- Stochastic scheduling of parallel queues with set-up costs (Q1905067) (← links)
- Sample path methods in the control of queues (Q1923637) (← links)
- The archievable region method in the optimal control of queueing systems; formulations, bounds and policies (Q1923638) (← links)
- Performance evaluation of scheduling control of queueing networks: Fluid model heuristics (Q1923639) (← links)
- Bandit and covariate processes, with finite or non-denumerable set of arms (Q2145828) (← links)
- On the Gittins index in the M/G/1 queue (Q2269488) (← links)
- Reading policies for joins: an asymptotic analysis (Q2467117) (← links)
- Reinforcement learning and evolutionary algorithms for non-stationary multi-armed bandit problems (Q2479159) (← links)
- Simultaneous optimization of flow control and scheduling in a single server queue with two job classes (Q2638918) (← links)
- Simultaneous optimization of flow-control and scheduling in a single server queue with two job classes: Numerical results and approximation (Q2638919) (← links)
- Competing Markov decision processes (Q2638972) (← links)
- On Gittins' index theorem in continuous time (Q2642040) (← links)
- On an Optimal Stopping Problem for Multi-Parameter Diffusion Processes (Q3209937) (← links)
- Tax problems in the undiscounted case (Q3367746) (← links)
- Branching Bandit Processes (Q3415889) (← links)
- A bisection/successive approximation method for computing Gittins indices (Q3970270) (← links)
- Dynamic allocation policies for the finite horizon one armed bandit problem (Q4215901) (← links)
- Survey of linear programming for standard and nonstandard Markovian control problems. Part II: Applications (Q4698111) (← links)
- Independently Expiring Multiarmed Bandits (Q4950727) (← links)
- A General Theory of MultiArmed Bandit Processes with Constrained Arm Switches (Q5020738) (← links)
- Optimistic Gittins Indices (Q5060515) (← links)
- New results for generalized bandit problems (Q5202859) (← links)
- Optimal control of single-server queueing networks (Q5286756) (← links)
- Open Bandit Processes with Uncountable States and Time-Backward Effects (Q5299564) (← links)
- MULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENT (Q5358026) (← links)
- Stationary multi-choice bandit problems. (Q5958100) (← links)
- Index policy for multiarmed bandit problem with dynamic risk measures (Q6090163) (← links)
- Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems (Q6167036) (← links)