Pages that link to "Item:Q3755256"
From MaRDI portal
The following pages link to The Multi-Armed Bandit Problem: Decomposition and Computation (Q3755256):
Displaying 48 items.
- Response-adaptive designs for clinical trials: simultaneous learning from multiple patients (Q320737) (← links)
- Resource capacity allocation to stochastic dynamic competitors: knapsack problem for perishable items and index-knapsack heuristic (Q333075) (← links)
- Four proofs of Gittins' multiarmed bandit theorem (Q333080) (← links)
- Continue, quit, restart probability model (Q333092) (← links)
- Perspectives of approximate dynamic programming (Q333093) (← links)
- The multi-armed bandit, with constraints (Q378726) (← links)
- Derman's book as inspiration: some results on LP for MDPs (Q378728) (← links)
- An asymptotically optimal policy for finite support models in the multiarmed bandit problem (Q415624) (← links)
- Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation (Q490352) (← links)
- City streets parking enforcement inspection decisions: the Chinese postman's perspective (Q726240) (← links)
- On the resolution of misspecified convex optimization and monotone variational inequality problems (Q782913) (← links)
- Adaptive approaches to stochastic programming (Q806717) (← links)
- A perpetual search for talents across overlapping generations: a learning process (Q898767) (← links)
- A generalized Gittins index for a Markov chain and its recursive calculation (Q945795) (← links)
- Stochastic scheduling and forwards induction (Q1346693) (← links)
- A common value experimentation with multiarmed bandits (Q1720971) (← links)
- A tutorial on Gaussian process regression: modelling, exploring, and exploiting functions (Q1735988) (← links)
- Enhancing gene expression programming based on space partition and jump for symbolic regression (Q2056306) (← links)
- An optimal stopping policy for car rental businesses with purchasing customers (Q2095189) (← links)
- Robust control of the multi-armed bandit problem (Q2095215) (← links)
- The performance of forwards induction policies (Q2368171) (← links)
- Finite state multi-armed bandit problems: Sensitive-discount, average-reward and average-overtaking optimality (Q2564701) (← links)
- Competing Markov decision processes (Q2638972) (← links)
- Ameso optimization: a relaxation of discrete midpoint convexity (Q2659175) (← links)
- Optimal learning with non-Gaussian rewards (Q2806349) (← links)
- On the Solution of Stochastic Optimization and Variational Problems in Imperfect Information Regimes (Q2832894) (← links)
- Optimal stopping of Markov chains and three abstract optimization problems (Q3108369) (← links)
- Branching Bandit Processes (Q3415889) (← links)
- Incentivizing Exploration with Heterogeneous Value of Money (Q3460803) (← links)
- Index policies for discounted bandit problems with availability constraints (Q3516395) (← links)
- A bisection/successive approximation method for computing Gittins indices (Q3970270) (← links)
- Dynamic allocation policies for the finite horizon one armed bandit problem (Q4215901) (← links)
- Survey of linear programming for standard and nonstandard Markovian control problems. Part II: Applications (Q4698111) (← links)
- On the optimal allocation of service to impatient tasks (Q4819435) (← links)
- Optimistic Gittins Indices (Q5060515) (← links)
- A Verification Theorem for Threshold-Indexability of Real-State Discounted Restless Bandits (Q5119843) (← links)
- The efficacy of league formats in ranking teams (Q5142216) (← links)
- Optimal Online Learning for Nonlinear Belief Models Using Discrete Priors (Q5144779) (← links)
- An Approximation Approach for Response-Adaptive Clinical Trial Design (Q5148171) (← links)
- Optimal control of single-server queueing networks (Q5286756) (← links)
- MULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENT (Q5358026) (← links)
- DES AND RES PROCESSES AND THEIR EXPLICIT SOLUTIONS (Q5358035) (← links)
- ASYMPTOTICALLY OPTIMAL MULTI-ARMED BANDIT POLICIES UNDER A COST CONSTRAINT (Q5358116) (← links)
- Optimal activation of halting multi‐armed bandit models (Q6057028) (← links)
- Index policy for multiarmed bandit problem with dynamic risk measures (Q6090163) (← links)
- Testing indexability and computing Whittle and Gittins index in subcubic time (Q6107877) (← links)
- Optimal and Efficient Auctions for the Gradual Procurement of Strategic Service Provider Agents (Q6135966) (← links)
- Consumer strategy, vendor strategy and equilibrium in duopoly markets with production costs (Q6177268) (← links)