Pages that link to "Item:Q1102059"
From MaRDI portal
The following pages link to Adaptive treatment allocation and the multi-armed bandit problem (Q1102059):
Displayed 43 items.
- Infomax strategies for an optimal balance between exploration and exploitation (Q310029) (← links)
- Optimal Bayesian strategies for the infinite-armed Bernoulli bandit (Q643377) (← links)
- Boundary crossing probabilities for general exponential families (Q722599) (← links)
- Nonparametric bandit methods (Q806690) (← links)
- A non-parametric solution to the multi-armed bandit problem with covariates (Q826996) (← links)
- An analysis of model-based interval estimation for Markov decision processes (Q959899) (← links)
- Small-sample performance of Bernoulli two-armed bandit Bayesian strategies (Q1298932) (← links)
- Optimal learning and experimentation in bandit problems. (Q1614793) (← links)
- On Bayesian index policies for sequential resource allocation (Q1750289) (← links)
- Optimal stopping for Brownian motion with applications to sequential analysis and option pricing (Q1763432) (← links)
- An online algorithm for the risk-aware restless bandit (Q2029383) (← links)
- Stochastic approximation: from statistical origin to big-data, multidisciplinary applications (Q2038304) (← links)
- Matrices -- compensating the loss of anschauung (Q2101899) (← links)
- Bandit and covariate processes, with finite or non-denumerable set of arms (Q2145828) (← links)
- The multi-armed bandit problem: an efficient nonparametric solution (Q2176624) (← links)
- Undiscounted bandit games (Q2212738) (← links)
- On the optimal amount of experimentation in sequential decision problems (Q2267618) (← links)
- Asymptotically optimal algorithms for budgeted multiple play bandits (Q2331676) (← links)
- Optimal strategies for a class of sequential control problems with precedence relations (Q2456018) (← links)
- Customization of J. Bather's UCB strategy for a Gaussian multiarmed bandit (Q2689638) (← links)
- (Q3121140) (← links)
- (Q4558161) (← links)
- Efficient Adaptive Randomization and Stopping Rules in Multi-arm Clinical Trials for Testing a New Treatment (Q4650223) (← links)
- Learning to Optimize via Information-Directed Sampling (Q4969321) (← links)
- (Q4986381) (← links)
- The Valuator’s Curse: Decision Analysis of Overvaluation and Disappointment in Acquisition (Q4991768) (← links)
- (Q5043718) (← links)
- Optimistic Gittins Indices (Q5060515) (← links)
- MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS (Q5072149) (← links)
- (Q5072154) (← links)
- Infinite Arms Bandit: Optimality via Confidence Bounds (Q5089465) (← links)
- Optimal Online Learning for Nonlinear Belief Models Using Discrete Priors (Q5144779) (← links)
- An Approximation Approach for Response-Adaptive Clinical Trial Design (Q5148171) (← links)
- A linear response bandit problem (Q5168867) (← links)
- Learning to Optimize via Posterior Sampling (Q5247618) (← links)
- Sequential Generalized Likelihood Ratios and Adaptive Treatment Allocation for Optimal Sequential Selection (Q5478885) (← links)
- Encounters with Martingales in Statistics and Stochastic Optimization (Q6096242) (← links)
- Reinforcement Learning, Bit by Bit (Q6139546) (← links)
- (Q6153269) (← links)
- Asymptotic optimality theory for active quickest detection with unknown postchange parameters (Q6166463) (← links)
- Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems (Q6167036) (← links)
- Poissonian two-armed bandit: a new approach (Q6173456) (← links)
- A confirmation of a conjecture on Feldman’s two-armed bandit problem (Q6198964) (← links)