Pages that link to "Item:Q3083924"
From MaRDI portal
The following pages link to Multi‐Armed Bandit Allocation Indices (Q3083924):
Displaying 50 items.
- Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges (Q254442) (← links)
- Infomax strategies for an optimal balance between exploration and exploitation (Q310029) (← links)
- Control problems in online advertising and benefits of randomized bidding strategies (Q328167) (← links)
- Four proofs of Gittins' multiarmed bandit theorem (Q333080) (← links)
- Perspectives of approximate dynamic programming (Q333093) (← links)
- Whittle index approach to size-aware scheduling for time-varying channels with multiple states (Q335893) (← links)
- Kullback-Leibler upper confidence bounds for optimal sequential allocation (Q366995) (← links)
- The multi-armed bandit, with constraints (Q378726) (← links)
- Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation (Q490352) (← links)
- Optimal switching between cash-flow streams (Q684138) (← links)
- Asymptotically optimal index policies for an abandonment queue with convex holding cost (Q747712) (← links)
- An asymptotically optimal strategy for constrained multi-armed bandit problems (Q784789) (← links)
- Optimal dynamic resource allocation to prevent defaults (Q1694773) (← links)
- A unified framework for stochastic optimization (Q1719609) (← links)
- Optimal learning before choice (Q1729685) (← links)
- On Bayesian index policies for sequential resource allocation (Q1750289) (← links)
- A linear-quadratic Gaussian approach to dynamic information acquisition (Q1754750) (← links)
- An online algorithm for the risk-aware restless bandit (Q2029383) (← links)
- Open problems in queueing theory inspired by datacenter computing (Q2052428) (← links)
- From reinforcement learning to optimal control: a unified framework for sequential decisions (Q2094027) (← links)
- Reinforcement learning: an industrial perspective (Q2094053) (← links)
- On the Gittins index for multistage jobs (Q2095039) (← links)
- The pure exploration problem with general reward functions depending on full distributions (Q2102381) (← links)
- Whittle index based Q-learning for restless bandits with average reward (Q2116660) (← links)
- Minimizing the mean slowdown in a single-server queue (Q2146415) (← links)
- Multi-machine preventive maintenance scheduling with imperfect interventions: a restless bandit approach (Q2177821) (← links)
- Multi-round cooperative search games with multiple players (Q2186824) (← links)
- An adversarial model for scheduling with testing (Q2211361) (← links)
- Adaptive policies for perimeter surveillance problems (Q2286935) (← links)
- On index policies for stochastic minsum scheduling (Q2294301) (← links)
- On the dynamic allocation of assets subject to failure (Q2301960) (← links)
- Approximately optimal scheduling of an \(\mathrm{M}/\mathrm{G}/1\) queue with heavy tails (Q2351799) (← links)
- Optimal discrete search with technological choice (Q2354015) (← links)
- Optimal stopping problems with restricted stopping times (Q2358495) (← links)
- Ameso optimization: a relaxation of discrete midpoint convexity (Q2659175) (← links)
- On the computation of Whittle's index for Markovian restless bandits (Q2661759) (← links)
- Optimal schedule of elective surgery operations subject to disruptions by emergencies (Q2698603) (← links)
- Optimal learning with non-Gaussian rewards (Q2806349) (← links)
- <i>r</i>-extreme signalling for congestion control (Q2979529) (← links)
- A forwards induction approach to candidate drug selection (Q3172999) (← links)
- Bayesian Incentive-Compatible Bandit Exploration (Q3387959) (← links)
- Incentivizing Exploration with Heterogeneous Value of Money (Q3460803) (← links)
- (Q4558161) (← links)
- (Q4558474) (← links)
- Optimal Learning for Stochastic Optimization with Nonlinear Parametric Belief Models (Q4586173) (← links)
- A reinforcement learning approach to personalized learning recommendation systems (Q4627519) (← links)
- MYOPIC POLICIES FOR NON-PREEMPTIVE SCHEDULING OF JOBS WITH DECAYING VALUE (Q4628406) (← links)
- BANDIT STRATEGIES EVALUATED IN THE CONTEXT OF CLINICAL TRIALS IN RARE LIFE-THREATENING DISEASES (Q4629422) (← links)
- (Q4633046) (← links)
- Learning to Optimize via Information-Directed Sampling (Q4969321) (← links)