Pages that link to "Item:Q1060517"

From MaRDI portal

← Asymptotically efficient adaptive allocation rules (Q1060517)

Jump to:navigation, search

What links here

⧼whatlinkshere-whatlinkshere-target⧽

Page:

⧼whatlinkshere-whatlinkshere-ns⧽

Namespace:

Invert selection

⧼whatlinkshere-whatlinkshere-filter⧽

Hide transclusions

Hide links

Hide redirects

The following pages link to Asymptotically efficient adaptive allocation rules (Q1060517):

Displaying 50 items.

Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges (Q254442) (← links)
Algorithm portfolios for noisy optimization (Q276539) (← links)
Batched bandit problems (Q282463) (← links)
Modification of improved upper confidence bounds for regulating exploration in Monte-Carlo tree search (Q307787) (← links)
Infomax strategies for an optimal balance between exploration and exploitation (Q310029) (← links)
Response-adaptive designs for clinical trials: simultaneous learning from multiple patients (Q320737) (← links)
Control problems in online advertising and benefits of randomized bidding strategies (Q328167) (← links)
The multi-armed bandit problem with covariates (Q355096) (← links)
Kullback-Leibler upper confidence bounds for optimal sequential allocation (Q366995) (← links)
Exploration and exploitation of scratch games (Q374139) (← links)
Two-armed bandit problem for parallel data processing systems (Q376114) (← links)
Adaptive aggregation for reinforcement learning in average reward Markov decision processes (Q378753) (← links)
Robustness of stochastic bandit policies (Q391739) (← links)
General time consistent discounting (Q391749) (← links)
An asymptotically optimal policy for finite support models in the multiarmed bandit problem (Q415624) (← links)
The \(K\)-armed dueling bandits problem (Q440003) (← links)
Regret bounds for restless Markov bandits (Q465253) (← links)
Near-optimal PAC bounds for discounted MDPs (Q465258) (← links)
Adaptive sensing performance lower bounds for sparse signal detection and support estimation (Q470072) (← links)
MSO: a framework for bound-constrained black-box global optimization algorithms (Q524912) (← links)
Analyzing bandit-based adaptive operator selection mechanisms (Q647443) (← links)
Modeling item-item similarities for personalized recommendations on Yahoo! front page (Q652346) (← links)
UCB revisited: improved regret bounds for the stochastic multi-armed bandit problem (Q653803) (← links)
Boundary crossing probabilities for general exponential families (Q722599) (← links)
An asymptotically optimal strategy for constrained multi-armed bandit problems (Q784789) (← links)
Nonparametric bandit methods (Q806690) (← links)
A non-parametric solution to the multi-armed bandit problem with covariates (Q826996) (← links)
Optimal control with learning on the fly: a toy problem (Q832436) (← links)
Woodroofe's one-armed bandit problem revisited (Q835072) (← links)
A perpetual search for talents across overlapping generations: a learning process (Q898767) (← links)
Online regret bounds for Markov decision processes with deterministic transitions (Q982638) (← links)
Active learning in heteroscedastic noise (Q982644) (← links)
Response adaptive designs that incorporate switching costs and constraints (Q997275) (← links)
Exploration-exploitation tradeoff using variance estimates in multi-armed bandits (Q1017665) (← links)
Certainty equivalence control with forcing: Revisited (Q1264127) (← links)
Optimal allocation of simulation experiments in discrete stochastic optimization and approximative algorithms (Q1278957) (← links)
The time until the final zero crossing of random sums with application to nonparametric bandit theory (Q1335239) (← links)
Optimal learning and experimentation in bandit problems. (Q1614793) (← links)
Improving multi-armed bandit algorithms in online pricing settings (Q1644914) (← links)
Bayesian policy reuse (Q1689554) (← links)
A quality assuring, cost optimal multi-armed bandit mechanism for expertsourcing (Q1690964) (← links)
An optimal bidimensional multi-armed bandit auction for multi-unit procurement (Q1714944) (← links)
A unified framework for stochastic optimization (Q1719609) (← links)
Randomized prediction of individual sequences (Q1733293) (← links)
On Bayesian index policies for sequential resource allocation (Q1750289) (← links)
Asymptotically efficient strategies for a stochastic scheduling problem with order constraints. (Q1848847) (← links)
Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates (Q1848931) (← links)
Regret bounds for sleeping experts and bandits (Q1959599) (← links)
Clustering in block Markov chains (Q1996780) (← links)
Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards (Q2006767) (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere/Item:Q1060517"