scientific article

From MaRDI portal
Publication:3815845

zbMath0664.90043MaRDI QIDQ3815845

Peter Whittle

Publication date: 1988


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (95)

Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpointsDynamic control in multi-item production/inventory systemsResource allocation and routing in parallel multi-server queues with abandonments for cloud profit maximizationStochastic scheduling and forwards inductionConditions for indexability of restless bandits and an algorithm to compute Whittle indexOptimal selection of obsolescence mitigation strategies using a restless bandit modelDynamic Programs with Shared Resources and Signals: Dynamic Fluid Policies and Asymptotic OptimalityOptimistic Gittins IndicesScalable Reinforcement Learning for Multiagent Networked SystemsDynamic routing to heterogeneous collections of unreliable serversMarginal productivity index policies for scheduling a multiclass delay-/loss-sensitive queueResource capacity allocation to stochastic dynamic competitors: knapsack problem for perishable items and index-knapsack heuristicFour proofs of Gittins' multiarmed bandit theoremWhittle index approach to size-aware scheduling for time-varying channels with multiple statesOn the optimal allocation of service to impatient tasksAdmission and routing of soft real-time jobs to multiclusters: design and comparison of index policiesDynamic routing in distinguishable parallel queues: an application of product returns for remanufacturingA Bayesian adaptive design for clinical trials in rare diseasesScheduling of multi-class multi-server queueing systems with abandonmentsIntegrated Online Learning and Adaptive Control in Queueing Systems with Uncertain PayoffsWhittle’s Index Policy for Multi-Target Tracking with Jamming and NondetectionsSensor Scheduling for Space Object Tracking and Collision AlertGeneralized Restless Bandits and the Knapsack Problem for Perishable InventoriesOn the computation of Whittle's index for Markovian restless banditsMulti-machine preventive maintenance scheduling with imperfect interventions: a restless bandit approachThe archievable region method in the optimal control of queueing systems; formulations, bounds and policiesTowards minimum loss job routing to parallel heterogeneous multiserver queues via index policiesIndex policy for multiarmed bandit problem with dynamic risk measuresMulti-armed bandit problem with online clustering as side informationTesting indexability and computing Whittle and Gittins index in subcubic timeWhittle's index based sensor scheduling for multiprocess systems under DoS attacksA perpetual search for talents across overlapping generations: a learning processOptimal dynamic resource allocation to prevent defaultsINDEXABILITY AND OPTIMAL INDEX POLICIES FOR A CLASS OF REINITIALISING RESTLESS BANDITSON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITSAlgorithms and mechanisms for procuring services with uncertain durations using redundancyExponential asymptotic optimality of Whittle index policyEmpirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problemsOptimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary RewardsA confirmation of a conjecture on Feldman’s two-armed bandit problemIndex policies for discounted bandit problems with availability constraintsA Verification Theorem for Threshold-Indexability of Real-State Discounted Restless BanditsPrioritizing Hepatitis C Treatment in U.S. PrisonsBranching bandits: A sequential search process with correlated pay-offs.A conservative index heuristic for routing problems with multiple heterogeneous service facilitiesThe role of information in system stability with partially observable serversRegret bounds for restless Markov banditsDynamic priority allocation via restless bandit marginal productivity indicesA novel scheduling index rule proposal for QoE maximization in wireless networksMYOPIC POLICIES FOR NON-PREEMPTIVE SCHEDULING OF JOBS WITH DECAYING VALUEOptimal sequential replenishment of ships during combatGroup Maintenance: A Restless Bandits ApproachBANDIT STRATEGIES EVALUATED IN THE CONTEXT OF CLINICAL TRIALS IN RARE LIFE-THREATENING DISEASESNonstationary Bandits with Habituation and Recovery DynamicsSome indexable families of restless bandit problemsStochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocationAn index heuristic for transshipment decisions in multi-location inventory systems based on a pairwise decompositionAttack allocation on remote state estimation in multi-systems: structural results and asymptotic solutionMonotone Policies and Indexability for Bidirectional Restless BanditsA survey of computational complexity results in systems and controlParameter Dependent Optimal Thresholds, Indifference Levels and Inverse Optimal Stopping ProblemsScheduling deteriorating jobs on a single machine subject to breakdownsDynamic resource allocation in a multi-product make-to-stock production systemGeneral notions of indexability for queueing control and asset managementOn the Gittins index in the M/G/1 queueUsing adaptive learning in credit scoring to estimate take-up probability distributionIndex policies for the maintenance of a collection of machines by a set of repairmenCoupled bisection for root orderingAn online algorithm for the risk-aware restless banditAn asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action banditsMarginal Productivity Index Policies for Admission Control and Routing to Parallel Multi-server Loss Queues with RenegingLinear programming relaxations and marginal productivity index policies for the buffer sharing problemEfficiency in lung transplant allocation strategiesAsymptotically optimal index policies for an abandonment queue with convex holding costOn the dynamic allocation of assets subject to failureTwo-Armed Restless Bandits with Imperfect Information: Stochastic Control and IndexabilitySpinning plates and squad systems: policies for bi-directional restless banditsA Continuous-Time Markov Decision Process for Infrastructure SurveillanceGittins' theorem under uncertaintyGittins Index for Simple Family of Markov Bandit Processes with Switching Cost and No DiscountingResource competition in virtual network embeddingLearning Unknown Service Rates in Queues: A Multiarmed Bandit ApproachResource-constrained management of heterogeneous assets with stochastic deteriorationTheoretical tools for understanding and aiding dynamic decision makingLearning, risk attitude and hot stoves in restless bandit problemsOn the Whittle index of Markov modulated restless banditsWhittle indexability in egalitarian processor sharing systemsRobust control of the multi-armed bandit problemUnnamed ItemTime-Constrained Restless Bandits and the Knapsack Problem for Perishable Items (Extended Abstract)A General Theory of MultiArmed Bandit Processes with Constrained Arm SwitchesMulti-armed bandit models for the optimal design of clinical trials: benefits and challengesWhittle index based Q-learning for restless bandits with average rewardAlgorithmic aspects of mean-variance optimization in Markov decision processesA Restless Bandit Model for Resource Allocation, Competition, and Reservation






This page was built for publication: