scientific article; zbMATH DE number 194374

From MaRDI portal
Publication:4692329

zbMath0699.90068MaRDI QIDQ4692329

J. C. Gittins

Publication date: 5 June 1993


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (only showing first 100 items - show all)

Optimal control of single-server queueing networksA linear response bandit problemMinimizing the mean slowdown in the M/G/1 queueTopp-Leone distribution with an application to binomial samplingMULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENTEmpirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problemsOptimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary RewardsUnnamed ItemSome indexable families of restless bandit problemsA survey of computational complexity results in systems and controlLearning in network contexts: experimental results from simulationsDynamic Pricing with a Poisson Bandit ModelLearning while searching for the best alternativeStationary multi-choice bandit problems.Randomized allocation with arm elimination in a bandit problem with covariatesSimulation optimization: a review of algorithms and applicationsSpinning plates and squad systems: policies for bi-directional restless banditsUnnamed ItemGeneralized Bandit ProblemsScheduling Jobs That Are Subject to Deterministic Due Dates and Have Deteriorating Expected RewardsOptimal learning and experimentation in bandit problems.A Note on Optimal Strategies of a Generalized Two-Stage Bandit ProblemTwo-Armed Bandit Strategies that Discount Past and FutureApplicable stochastic control: From theory to practiceMulti-armed bandit problem revisitedThe performance of forwards induction policiesWoodroofe's one-armed bandit problem revisitedThe optimal sequential information acquisition structure: a rational utility-maximizing perspectiveGeneralized two-stage bandit problemStochastic scheduling and forwards inductionOptimal selection of obsolescence mitigation strategies using a restless bandit modelMinimizing the mean slowdown in a single-server queueResponse-adaptive designs for clinical trials: simultaneous learning from multiple patientsCompeting Markov decision processesIncentivizing Exploration with Heterogeneous Value of MoneyMulti-armed bandit processes with optimal selection of the operating timesFour proofs of Gittins' multiarmed bandit theoremStochastic scheduling of parallel queues with set-up costsOn the optimal allocation of service to impatient tasksTechnology diffusion by learning from neighboursSelf-confirming equilibrium and the Lucas critiqueScheduling of multi-class multi-server queueing systems with abandonmentsOptimal myopic policies and index policies for stochastic scheduling problemsSequential allocation in clinical trialsThe multi-armed bandit, with constraintsThe system of quasi-variational inequalities attached to the two-armed bandit problemOne-armed bandit process with a covariateThe archievable region method in the optimal control of queueing systems; formulations, bounds and policiesZero-sum Games for Discrete-time Multi-armed Bandit Processes with a Generalized DiscountThe expected asymptotical ratio for preemptive stochastic online problemA statistical approach to adaptive problem solvingBayesian bandits in clinical trialsAn asymptotically optimal policy for finite support models in the multiarmed bandit problemA Bayesian approach to the triage problem with imperfect classificationCustomization of J. Bather's UCB strategy for a Gaussian multiarmed banditRecent sojourn time results for multilevel processor‐sharing scheduling disciplinesOptimal Bayesian strategies for the infinite-armed Bernoulli banditTwo-parameter optimal stopping problem with switching costsOptimal hysteresis for a class of deterministic deteriorating two-armed bandit problem with switching costs.Branching bandits: A sequential search process with correlated pay-offs.A dynamic programming strategy to balance exploration and exploitation in the bandit problemA unified framework for stochastic optimizationDynamic priority allocation via restless bandit marginal productivity indicesHerbert Robbins and sequential analysisAmbiguity aversion in multi-armed bandit problemsOptimal strategies for a class of sequential control problems with precedence relationsMathematical problems in the theory of processor-sharing queueing systemsRandomized prediction of individual sequencesA second order SDE for the Langevin process reflected at a completely inelastic boundaryA program for sequential allocation of three Bernoulli populationsUnnamed ItemGenerative adversarial networks are special cases of artificial curiosity (1990) and also closely related to predictability minimization (1991)Reading policies for joins: an asymptotic analysisA comparative study of ad hoc techniques and evolutionary methods for multi-armed bandit problemsMonotone Policies and Indexability for Bidirectional Restless BanditsReinforcement learning and evolutionary algorithms for non-stationary multi-armed bandit problemsA behavioral learning process in gamesA note on infinite-armed Bernoulli bandit problems with generalized beta prior distributionsGeneral notions of indexability for queueing control and asset managementDynamic price competitionOn the Gittins index in the M/G/1 queueThe prediction distribution for the heteroscedastic multivariate lineary modelsUsing adaptive learning in credit scoring to estimate take-up probability distributionIndex policies for the maintenance of a collection of machines by a set of repairmenSensitivity of the gittins index in the contiuous time two-armed bandit problemAdaptive Incentive-Compatible Sponsored Search AuctionInfinite Arms Bandit: Optimality via Confidence BoundsIndependently Expiring Multiarmed BanditsTax problems in the undiscounted caseCustomer Scheduling with Incomplete InformationScheduling policies for an antiterrorist surveillance systemOn index policies for stochastic minsum schedulingEfficiency in lung transplant allocation strategiesA Bayesian Decision Approach for Sample Size Determination in Phase II TrialsDecision-Theoretic Designs for Phase II Clinical Trials Allowing for Competing StudiesExploration-exploitation tradeoff using variance estimates in multi-armed banditsUn ordonnancement dynamique de tâches stochastiques sur un seul processeurOptimal allocation of simulation experiments in discrete stochastic optimization and approximative algorithmsPROPERTIES OF THE GITTINS INDEX WITH APPLICATION TO OPTIMAL SCHEDULINGOn the Solution of Stochastic Optimization and Variational Problems in Imperfect Information Regimes




This page was built for publication: