Pages that link to "Item:Q1203758"
From MaRDI portal
The following pages link to On the Gittins index for multiarmed bandits (Q1203758):
Displayed 30 items.
- Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges (Q254442) (← links)
- Four proofs of Gittins' multiarmed bandit theorem (Q333080) (← links)
- Kullback-Leibler upper confidence bounds for optimal sequential allocation (Q366995) (← links)
- The multi-armed bandit, with constraints (Q378726) (← links)
- Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation (Q490352) (← links)
- Dynamic priority allocation via restless bandit marginal productivity indices (Q926578) (← links)
- Multi-armed bandits in discrete and continuous time (Q1296724) (← links)
- Multi-armed bandit problem revisited (Q1337211) (← links)
- Information-gain computation in the \textsc{Fifth} system (Q1726365) (← links)
- The archievable region method in the optimal control of queueing systems; formulations, bounds and policies (Q1923638) (← links)
- Efficiency in lung transplant allocation strategies (Q2044231) (← links)
- Gittins' theorem under uncertainty (Q2076662) (← links)
- On the Gittins index in the M/G/1 queue (Q2269488) (← links)
- Multi-armed bandit processes with optimal selection of the operating times (Q2387146) (← links)
- Reading policies for joins: an asymptotic analysis (Q2467117) (← links)
- On Gittins' index theorem in continuous time (Q2642040) (← links)
- Stopped decision processes in conjunction with general utility (Q2766113) (← links)
- (Q4558474) (← links)
- Survey of linear programming for standard and nonstandard Markovian control problems. Part II: Applications (Q4698111) (← links)
- Independently Expiring Multiarmed Bandits (Q4950727) (← links)
- Technical Note—A Note on the Equivalence of Upper Confidence Bounds and Gittins Indices for Patient Agents (Q4994155) (← links)
- Optimistic Gittins Indices (Q5060515) (← links)
- Open Bandit Processes with Uncountable States and Time-Backward Effects (Q5299564) (← links)
- MULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENT (Q5358026) (← links)
- ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS (Q5358114) (← links)
- Gambling Under Unknown Probabilities as Proxy for Real World Decisions Under Uncertainty (Q5885234) (← links)
- Optimal activation of halting multi‐armed bandit models (Q6057028) (← links)
- Index policy for multiarmed bandit problem with dynamic risk measures (Q6090163) (← links)
- Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems (Q6167036) (← links)
- Optimal Dynamic Information Acquisition (Q6181690) (← links)