Pages that link to "Item:Q3083924"

From MaRDI portal

← Multi‐Armed Bandit Allocation Indices (Q3083924)

Jump to:navigation, search

What links here

⧼whatlinkshere-whatlinkshere-target⧽

Page:

⧼whatlinkshere-whatlinkshere-ns⧽

Namespace:

Invert selection

⧼whatlinkshere-whatlinkshere-filter⧽

Hide transclusions

Hide links

Hide redirects

The following pages link to Multi‐Armed Bandit Allocation Indices (Q3083924):

Displaying 50 items.

Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges (Q254442) (← links)
Infomax strategies for an optimal balance between exploration and exploitation (Q310029) (← links)
Control problems in online advertising and benefits of randomized bidding strategies (Q328167) (← links)
Four proofs of Gittins' multiarmed bandit theorem (Q333080) (← links)
Perspectives of approximate dynamic programming (Q333093) (← links)
Whittle index approach to size-aware scheduling for time-varying channels with multiple states (Q335893) (← links)
Kullback-Leibler upper confidence bounds for optimal sequential allocation (Q366995) (← links)
The multi-armed bandit, with constraints (Q378726) (← links)
Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation (Q490352) (← links)
Optimal switching between cash-flow streams (Q684138) (← links)
Asymptotically optimal index policies for an abandonment queue with convex holding cost (Q747712) (← links)
An asymptotically optimal strategy for constrained multi-armed bandit problems (Q784789) (← links)
Optimal dynamic resource allocation to prevent defaults (Q1694773) (← links)
A unified framework for stochastic optimization (Q1719609) (← links)
Optimal learning before choice (Q1729685) (← links)
On Bayesian index policies for sequential resource allocation (Q1750289) (← links)
A linear-quadratic Gaussian approach to dynamic information acquisition (Q1754750) (← links)
An online algorithm for the risk-aware restless bandit (Q2029383) (← links)
Open problems in queueing theory inspired by datacenter computing (Q2052428) (← links)
From reinforcement learning to optimal control: a unified framework for sequential decisions (Q2094027) (← links)
Reinforcement learning: an industrial perspective (Q2094053) (← links)
On the Gittins index for multistage jobs (Q2095039) (← links)
The pure exploration problem with general reward functions depending on full distributions (Q2102381) (← links)
Whittle index based Q-learning for restless bandits with average reward (Q2116660) (← links)
Minimizing the mean slowdown in a single-server queue (Q2146415) (← links)
Multi-machine preventive maintenance scheduling with imperfect interventions: a restless bandit approach (Q2177821) (← links)
Multi-round cooperative search games with multiple players (Q2186824) (← links)
An adversarial model for scheduling with testing (Q2211361) (← links)
Adaptive policies for perimeter surveillance problems (Q2286935) (← links)
On index policies for stochastic minsum scheduling (Q2294301) (← links)
On the dynamic allocation of assets subject to failure (Q2301960) (← links)
Approximately optimal scheduling of an \(\mathrm{M}/\mathrm{G}/1\) queue with heavy tails (Q2351799) (← links)
Optimal discrete search with technological choice (Q2354015) (← links)
Optimal stopping problems with restricted stopping times (Q2358495) (← links)
Ameso optimization: a relaxation of discrete midpoint convexity (Q2659175) (← links)
On the computation of Whittle's index for Markovian restless bandits (Q2661759) (← links)
Optimal schedule of elective surgery operations subject to disruptions by emergencies (Q2698603) (← links)
Optimal learning with non-Gaussian rewards (Q2806349) (← links)
<i>r</i>-extreme signalling for congestion control (Q2979529) (← links)
A forwards induction approach to candidate drug selection (Q3172999) (← links)
Bayesian Incentive-Compatible Bandit Exploration (Q3387959) (← links)
Incentivizing Exploration with Heterogeneous Value of Money (Q3460803) (← links)
(Q4558161) (← links)
(Q4558474) (← links)
Optimal Learning for Stochastic Optimization with Nonlinear Parametric Belief Models (Q4586173) (← links)
A reinforcement learning approach to personalized learning recommendation systems (Q4627519) (← links)
MYOPIC POLICIES FOR NON-PREEMPTIVE SCHEDULING OF JOBS WITH DECAYING VALUE (Q4628406) (← links)
BANDIT STRATEGIES EVALUATED IN THE CONTEXT OF CLINICAL TRIALS IN RARE LIFE-THREATENING DISEASES (Q4629422) (← links)
(Q4633046) (← links)
Learning to Optimize via Information-Directed Sampling (Q4969321) (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere/Item:Q3083924"