The following pages link to (Q4692329):
Displaying 50 items.
- Response-adaptive designs for clinical trials: simultaneous learning from multiple patients (Q320737) (← links)
- Four proofs of Gittins' multiarmed bandit theorem (Q333080) (← links)
- The multi-armed bandit, with constraints (Q378726) (← links)
- One-armed bandit process with a covariate (Q380008) (← links)
- The expected asymptotical ratio for preemptive stochastic online problem (Q391147) (← links)
- An asymptotically optimal policy for finite support models in the multiarmed bandit problem (Q415624) (← links)
- A Bayesian approach to the triage problem with imperfect classification (Q421634) (← links)
- General notions of indexability for queueing control and asset management (Q549860) (← links)
- Optimal Bayesian strategies for the infinite-armed Bernoulli bandit (Q643377) (← links)
- A dynamic programming strategy to balance exploration and exploitation in the bandit problem (Q647433) (← links)
- Ambiguity aversion in multi-armed bandit problems (Q656883) (← links)
- A behavioral learning process in games (Q700080) (← links)
- On the resolution of misspecified convex optimization and monotone variational inequality problems (Q782913) (← links)
- Woodroofe's one-armed bandit problem revisited (Q835072) (← links)
- Two-parameter optimal stopping problem with switching costs (Q917158) (← links)
- Dynamic priority allocation via restless bandit marginal productivity indices (Q926578) (← links)
- A second order SDE for the Langevin process reflected at a completely inelastic boundary (Q936151) (← links)
- A comparative study of ad hoc techniques and evolutionary methods for multi-armed bandit problems (Q949395) (← links)
- Exploration-exploitation tradeoff using variance estimates in multi-armed bandits (Q1017665) (← links)
- Risk aversion in expected intertemporal discounted utilities bandit problems (Q1036105) (← links)
- A Bayesian analysis of human decision-making on bandit problems (Q1042313) (← links)
- Mathematical problems in the theory of processor-sharing queueing systems (Q1179867) (← links)
- Optimal allocation of simulation experiments in discrete stochastic optimization and approximative algorithms (Q1278957) (← links)
- Single machine scheduling when processing times are correlated normal random variables (Q1291584) (← links)
- Multi-armed bandits in discrete and continuous time (Q1296724) (← links)
- Small-sample performance of Bernoulli two-armed bandit Bayesian strategies (Q1298932) (← links)
- On scheduling influential stochastic tasks on a single machine (Q1310026) (← links)
- Applicable stochastic control: From theory to practice (Q1330528) (← links)
- Multi-armed bandit problem revisited (Q1337211) (← links)
- Stochastic scheduling and forwards induction (Q1346693) (← links)
- A statistical approach to adaptive problem solving (Q1391900) (← links)
- Optimal hysteresis for a class of deterministic deteriorating two-armed bandit problem with switching costs. (Q1421425) (← links)
- Branching bandits: A sequential search process with correlated pay-offs. (Q1421893) (← links)
- Herbert Robbins and sequential analysis (Q1429307) (← links)
- Optimal learning and experimentation in bandit problems. (Q1614793) (← links)
- The optimal sequential information acquisition structure: a rational utility-maximizing perspective (Q1629762) (← links)
- A unified framework for stochastic optimization (Q1719609) (← links)
- Randomized prediction of individual sequences (Q1733293) (← links)
- A note on infinite-armed Bernoulli bandit problems with generalized beta prior distributions (Q1767307) (← links)
- Index policies for the maintenance of a collection of machines by a set of repairmen (Q1776971) (← links)
- Asymptotically efficient strategies for a stochastic scheduling problem with order constraints. (Q1848847) (← links)
- Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates (Q1848931) (← links)
- Stochastic scheduling of parallel queues with set-up costs (Q1905067) (← links)
- The archievable region method in the optimal control of queueing systems; formulations, bounds and policies (Q1923638) (← links)
- A program for sequential allocation of three Bernoulli populations (Q1978413) (← links)
- Generative adversarial networks are special cases of artificial curiosity (1990) and also closely related to predictability minimization (1991) (Q1982402) (← links)
- Efficiency in lung transplant allocation strategies (Q2044231) (← links)
- On the Gittins index for multistage jobs (Q2095039) (← links)
- On the Whittle index of Markov modulated restless bandits (Q2095040) (← links)
- Minimizing the mean slowdown in a single-server queue (Q2146415) (← links)