scientific article; zbMATH DE number 3638998

From MaRDI portal
Revision as of 13:16, 6 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:4197923

zbMath0411.62055MaRDI QIDQ4197923

J. C. Gittins

Publication date: 1979


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (only showing first 100 items - show all)

Functional Sequential Treatment AllocationA forwards induction approach to candidate drug selectionConditions for indexability of restless bandits and an algorithm to compute Whittle indexOptimistic Gittins IndicesOn the optimal allocation of service to impatient tasksMulti-Actor Markov Decision ProcessesMULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONSBayesian Exploration: Incentivizing Exploration in Bayesian GamesWhittle’s Index Policy for Multi-Target Tracking with Jamming and NondetectionsGeneralized Restless Bandits and the Knapsack Problem for Perishable InventoriesDynamic Learning and Decision Making via Basis Weight VectorsOptimal activation of halting multi‐armed bandit modelsA novel statistical test for treatment differences in clinical trials using a response‐adaptive forward‐looking Gittins Index RuleEncounters with Martingales in Statistics and Stochastic OptimizationMulti-armed bandit problem with online clustering as side informationTesting indexability and computing Whittle and Gittins index in subcubic timeMULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENTINDEXABILITY AND OPTIMAL INDEX POLICIES FOR A CLASS OF REINITIALISING RESTLESS BANDITSTreatment recommendation with distributional targetsExponential asymptotic optimality of Whittle index policyEmpirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problemsA general approximation method for optimal stopping and random delayStochastic Probing with Increasing PrecisionOptimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary RewardsIndex policies for discounted bandit problems with availability constraintsLearning the distribution with largest mean: two bandit frameworksA Verification Theorem for Threshold-Indexability of Real-State Discounted Restless BanditsFinite-Time Analysis for the Knowledge-Gradient PolicyBANDIT STRATEGIES EVALUATED IN THE CONTEXT OF CLINICAL TRIALS IN RARE LIFE-THREATENING DISEASESCoping with Incomplete Information in Scheduling — Stochastic and Online ModelsNonstationary Bandits with Habituation and Recovery DynamicsLearning in Combinatorial Optimization: What and How to ExploreSimple Bayesian Algorithms for Best-Arm IdentificationSome indexable families of restless bandit problemsAn Approximation Approach for Response-Adaptive Clinical Trial DesignApproximate Dynamic Programming based on High Dimensional Model RepresentationMonotone Policies and Indexability for Bidirectional Restless BanditsExplicit Gittins Indices for a Class of Superdiffusive ProcessesTax problems in the undiscounted caseSurvey of linear programming for standard and nonstandard Markovian control problems. Part II: ApplicationsTwo-Armed Restless Bandits with Imperfect Information: Stochastic Control and IndexabilityWhen to Abandon a Research Project and Search for a New OneBayesian Incentive-Compatible Bandit ExplorationSpinning plates and squad systems: policies for bi-directional restless banditsUn ordonnancement dynamique de tâches stochastiques sur un seul processeurStochastic graph explorationUnnamed ItemOnline Collaborative Filtering on GraphsA Continuous-Time Markov Decision Process for Infrastructure SurveillanceGeneralized Bandit ProblemsLearning Unknown Service Rates in Queues: A Multiarmed Bandit ApproachA Tight 2-Approximation for Preemptive Stochastic SchedulingA simulation-based approach to stochastic dynamic programmingTime-Constrained Restless Bandits and the Knapsack Problem for Perishable Items (Extended Abstract)A General Theory of MultiArmed Bandit Processes with Constrained Arm SwitchesA Bandit-Learning Approach to Multifidelity ApproximationA Restless Bandit Model for Resource Allocation, Competition, and ReservationOptimal learning and experimentation in bandit problems.An optimal stopping time problem with time average cost in a bounded intervalFast and slow enigmas and parental guidanceSEH: size estimate hedging for single-server queuesThe performance of forwards induction policiesParallel search for the best alternativeApproximation algorithms for stochastic combinatorial optimization problemsSequencing unreliable jobs on parallel machinesSelecting among scheduled projectsStochastic scheduling and forwards inductionLearning theorem proving componentsInfomax strategies for an optimal balance between exploration and exploitationMulti-armed bandits with simple armsSearch and active learning with correlated information: empirical evidence from mid-Atlantic clam fishermenBandit and covariate processes, with finite or non-denumerable set of armsResponse-adaptive designs for clinical trials: simultaneous learning from multiple patientsCompeting Markov decision processesRobust experimentation in the continuous time bandit problemMulti-armed bandit processes with optimal selection of the operating timesOn Gittins' index theorem in continuous timeMarginal productivity index policies for scheduling a multiclass delay-/loss-sensitive queueResource capacity allocation to stochastic dynamic competitors: knapsack problem for perishable items and index-knapsack heuristicFour proofs of Gittins' multiarmed bandit theoremOptimal decision indices for R\&D project evaluation in the pharmaceutical industry: Pearson index versus Gittins indexControl, cost, and confidence: perseverance and procrastination in the face of failureA Bayesian adaptive design for clinical trials in rare diseasesAssessing the effects of machine breakdowns in stochastic schedulingBounds on optimal values in stochastic schedulingSelecting jobs for scheduling on a machine subject to failureKullback-Leibler upper confidence bounds for optimal sequential allocationA fluid approach to large volume job shop schedulingStochastic scheduling on a single machine subject to multiple breakdowns according to different probabilitiesThe multi-armed bandit problem: an efficient nonparametric solutionThe multi-armed bandit, with constraintsDerman's book as inspiration: some results on LP for MDPsMulti-machine preventive maintenance scheduling with imperfect interventions: a restless bandit approachThe archievable region method in the optimal control of queueing systems; formulations, bounds and policiesThe expected asymptotical ratio for preemptive stochastic online problemGeneral time consistent discountingRational status quoOptimal experimental design for a class of bandit problemsAn application of Edgeworth expansion in Bayesian inferences: Optimal sample sizes in clinical trialsTruthful learning mechanisms for multi-slot sponsored search auctions with externalities







This page was built for publication: