Pages that link to "Item:Q3755256"

What links here

⧼whatlinkshere-whatlinkshere-target⧽

Page:

⧼whatlinkshere-whatlinkshere-ns⧽

Namespace:

Invert selection

⧼whatlinkshere-whatlinkshere-filter⧽

Hide transclusions

Hide links

Hide redirects

The following pages link to The Multi-Armed Bandit Problem: Decomposition and Computation (Q3755256):

Displaying 49 items.

Response-adaptive designs for clinical trials: simultaneous learning from multiple patients (Q320737) (← links)
Resource capacity allocation to stochastic dynamic competitors: knapsack problem for perishable items and index-knapsack heuristic (Q333075) (← links)
Four proofs of Gittins' multiarmed bandit theorem (Q333080) (← links)
Continue, quit, restart probability model (Q333092) (← links)
Perspectives of approximate dynamic programming (Q333093) (← links)
The multi-armed bandit, with constraints (Q378726) (← links)
Derman's book as inspiration: some results on LP for MDPs (Q378728) (← links)
An asymptotically optimal policy for finite support models in the multiarmed bandit problem (Q415624) (← links)
Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation (Q490352) (← links)
City streets parking enforcement inspection decisions: the Chinese postman's perspective (Q726240) (← links)
On the resolution of misspecified convex optimization and monotone variational inequality problems (Q782913) (← links)
Adaptive approaches to stochastic programming (Q806717) (← links)
A perpetual search for talents across overlapping generations: a learning process (Q898767) (← links)
A generalized Gittins index for a Markov chain and its recursive calculation (Q945795) (← links)
Stochastic scheduling and forwards induction (Q1346693) (← links)
A common value experimentation with multiarmed bandits (Q1720971) (← links)
A tutorial on Gaussian process regression: modelling, exploring, and exploiting functions (Q1735988) (← links)
Enhancing gene expression programming based on space partition and jump for symbolic regression (Q2056306) (← links)
An optimal stopping policy for car rental businesses with purchasing customers (Q2095189) (← links)
Robust control of the multi-armed bandit problem (Q2095215) (← links)
The performance of forwards induction policies (Q2368171) (← links)
Finite state multi-armed bandit problems: Sensitive-discount, average-reward and average-overtaking optimality (Q2564701) (← links)
Competing Markov decision processes (Q2638972) (← links)
Ameso optimization: a relaxation of discrete midpoint convexity (Q2659175) (← links)
Optimal learning with non-Gaussian rewards (Q2806349) (← links)
On the Solution of Stochastic Optimization and Variational Problems in Imperfect Information Regimes (Q2832894) (← links)
Optimal stopping of Markov chains and three abstract optimization problems (Q3108369) (← links)
Branching Bandit Processes (Q3415889) (← links)
Incentivizing Exploration with Heterogeneous Value of Money (Q3460803) (← links)
Index policies for discounted bandit problems with availability constraints (Q3516395) (← links)
A bisection/successive approximation method for computing Gittins indices (Q3970270) (← links)
Dynamic allocation policies for the finite horizon one armed bandit problem (Q4215901) (← links)
Survey of linear programming for standard and nonstandard Markovian control problems. Part II: Applications (Q4698111) (← links)
On the optimal allocation of service to impatient tasks (Q4819435) (← links)
Optimistic Gittins Indices (Q5060515) (← links)
A Verification Theorem for Threshold-Indexability of Real-State Discounted Restless Bandits (Q5119843) (← links)
The efficacy of league formats in ranking teams (Q5142216) (← links)
Optimal Online Learning for Nonlinear Belief Models Using Discrete Priors (Q5144779) (← links)
An Approximation Approach for Response-Adaptive Clinical Trial Design (Q5148171) (← links)
Optimal control of single-server queueing networks (Q5286756) (← links)
MULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENT (Q5358026) (← links)
DES AND RES PROCESSES AND THEIR EXPLICIT SOLUTIONS (Q5358035) (← links)
ASYMPTOTICALLY OPTIMAL MULTI-ARMED BANDIT POLICIES UNDER A COST CONSTRAINT (Q5358116) (← links)
Optimal activation of halting multi‐armed bandit models (Q6057028) (← links)
Index policy for multiarmed bandit problem with dynamic risk measures (Q6090163) (← links)
Testing indexability and computing Whittle and Gittins index in subcubic time (Q6107877) (← links)
Optimal and Efficient Auctions for the Gradual Procurement of Strategic Service Provider Agents (Q6135966) (← links)
Consumer strategy, vendor strategy and equilibrium in duopoly markets with production costs (Q6177268) (← links)
A stochastic differential equation driven by Poisson random measure and its application in a duopoly market (Q6534435) (← links)