scientific article

From MaRDI portal
Publication:3882215

zbMath0439.90096MaRDI QIDQ3882215

Peter Whittle

Publication date: 1980


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (67)

Optimal learning and experimentation in bandit problems.Applicable stochastic control: From theory to practiceA bisection/successive approximation method for computing Gittins indicesOptimal control of single-server queueing networksMulti-armed bandit problem revisitedThe performance of forwards induction policiesParallel search for the best alternativeStrategic conversations under imperfect information: epistemic message exchange gamesComputational aspects in applied stochastic controlStochastic scheduling and forwards inductionMulti-armed bandits with simple armsBandit and covariate processes, with finite or non-denumerable set of armsOptimistic Gittins IndicesCompeting Markov decision processesIncentivizing Exploration with Heterogeneous Value of MoneyMulti-armed bandit processes with optimal selection of the operating timesOn Gittins' index theorem in continuous timeFour proofs of Gittins' multiarmed bandit theoremContinue, quit, restart probability modelMULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONSBandit Theory: Applications to Learning Healthcare Systems and Clinical TrialsKullback-Leibler upper confidence bounds for optimal sequential allocationStochastic scheduling on a single machine subject to multiple breakdowns according to different probabilitiesThe multi-armed bandit, with constraintsMulti-machine preventive maintenance scheduling with imperfect interventions: a restless bandit approachThe archievable region method in the optimal control of queueing systems; formulations, bounds and policiesOptimal activation of halting multi‐armed bandit modelsCommon value experimentationIndex policy for multiarmed bandit problem with dynamic risk measuresEncounters with Martingales in Statistics and Stochastic OptimizationAn application of Edgeworth expansion in Bayesian inferences: Optimal sample sizes in clinical trialsA perpetual search for talents across overlapping generations: a learning processON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITSEmpirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problemsIndex policies for discounted bandit problems with availability constraintsA Verification Theorem for Threshold-Indexability of Real-State Discounted Restless BanditsDecomposing risk in an exploitation-exploration problem with endogenous termination timeBranching bandits: A sequential search process with correlated pay-offs.Sequential process control under capacity constraints.Dynamic priority allocation via restless bandit marginal productivity indicesA Knowledge Gradient Policy for Sequencing Experiments to Identify the Structure of RNA Molecules Using a Sparse Additive Belief ModelOptimal learning before choiceSome indexable families of restless bandit problemsStochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocationA generalized Gittins index for a Markov chain and its recursive calculationMax-plus decomposition of supermartingales and convex order. Application to American options and portfolio insuranceOptimal stopping for Brownian motion with applications to sequential analysis and option pricingExplicit Gittins Indices for a Class of Superdiffusive ProcessesAn asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action banditsTax problems in the undiscounted caseSurvey of linear programming for standard and nonstandard Markovian control problems. Part II: ApplicationsA survey of Markov decision models for control of networks of queuesEfficiency in lung transplant allocation strategiesRisk-Averse Approximate Dynamic Programming with Quantile-Based Risk MeasuresUn ordonnancement dynamique de tâches stochastiques sur un seul processeurUnnamed ItemGittins' theorem under uncertaintyFinite state multi-armed bandit problems: Sensitive-discount, average-reward and average-overtaking optimalityDismemberment and design for controlling the replication variance of regret for the multi-armed banditMulti-armed bandits in discrete and continuous timeUncertainty in learning, choice, and visual fixationK competing queues with geometric service requirements and linear costs: The \(\mu\) c-rule is always optimalOn scheduling influential stochastic tasks on a single machineMatrices -- compensating the loss of anschauungA General Theory of MultiArmed Bandit Processes with Constrained Arm SwitchesMulti-armed bandit models for the optimal design of clinical trials: benefits and challengesOptimal stopping problems for multiarmed bandit processes with arms' independence






This page was built for publication: