scientific article
From MaRDI portal
Publication:3815845
zbMath0664.90043MaRDI QIDQ3815845
Publication date: 1988
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Lagrange multiplierstochastic schedulingmulti-armed banditGittins' indextotal expected rewardsequential schedulingEhrenfest project
Related Items (95)
Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints ⋮ Dynamic control in multi-item production/inventory systems ⋮ Resource allocation and routing in parallel multi-server queues with abandonments for cloud profit maximization ⋮ Stochastic scheduling and forwards induction ⋮ Conditions for indexability of restless bandits and an algorithm to compute Whittle index ⋮ Optimal selection of obsolescence mitigation strategies using a restless bandit model ⋮ Dynamic Programs with Shared Resources and Signals: Dynamic Fluid Policies and Asymptotic Optimality ⋮ Optimistic Gittins Indices ⋮ Scalable Reinforcement Learning for Multiagent Networked Systems ⋮ Dynamic routing to heterogeneous collections of unreliable servers ⋮ Marginal productivity index policies for scheduling a multiclass delay-/loss-sensitive queue ⋮ Resource capacity allocation to stochastic dynamic competitors: knapsack problem for perishable items and index-knapsack heuristic ⋮ Four proofs of Gittins' multiarmed bandit theorem ⋮ Whittle index approach to size-aware scheduling for time-varying channels with multiple states ⋮ On the optimal allocation of service to impatient tasks ⋮ Admission and routing of soft real-time jobs to multiclusters: design and comparison of index policies ⋮ Dynamic routing in distinguishable parallel queues: an application of product returns for remanufacturing ⋮ A Bayesian adaptive design for clinical trials in rare diseases ⋮ Scheduling of multi-class multi-server queueing systems with abandonments ⋮ Integrated Online Learning and Adaptive Control in Queueing Systems with Uncertain Payoffs ⋮ Whittle’s Index Policy for Multi-Target Tracking with Jamming and Nondetections ⋮ Sensor Scheduling for Space Object Tracking and Collision Alert ⋮ Generalized Restless Bandits and the Knapsack Problem for Perishable Inventories ⋮ On the computation of Whittle's index for Markovian restless bandits ⋮ Multi-machine preventive maintenance scheduling with imperfect interventions: a restless bandit approach ⋮ The archievable region method in the optimal control of queueing systems; formulations, bounds and policies ⋮ Towards minimum loss job routing to parallel heterogeneous multiserver queues via index policies ⋮ Index policy for multiarmed bandit problem with dynamic risk measures ⋮ Multi-armed bandit problem with online clustering as side information ⋮ Testing indexability and computing Whittle and Gittins index in subcubic time ⋮ Whittle's index based sensor scheduling for multiprocess systems under DoS attacks ⋮ A perpetual search for talents across overlapping generations: a learning process ⋮ Optimal dynamic resource allocation to prevent defaults ⋮ INDEXABILITY AND OPTIMAL INDEX POLICIES FOR A CLASS OF REINITIALISING RESTLESS BANDITS ⋮ ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS ⋮ Algorithms and mechanisms for procuring services with uncertain durations using redundancy ⋮ Exponential asymptotic optimality of Whittle index policy ⋮ Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems ⋮ Optimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary Rewards ⋮ A confirmation of a conjecture on Feldman’s two-armed bandit problem ⋮ Index policies for discounted bandit problems with availability constraints ⋮ A Verification Theorem for Threshold-Indexability of Real-State Discounted Restless Bandits ⋮ Prioritizing Hepatitis C Treatment in U.S. Prisons ⋮ Branching bandits: A sequential search process with correlated pay-offs. ⋮ A conservative index heuristic for routing problems with multiple heterogeneous service facilities ⋮ The role of information in system stability with partially observable servers ⋮ Regret bounds for restless Markov bandits ⋮ Dynamic priority allocation via restless bandit marginal productivity indices ⋮ A novel scheduling index rule proposal for QoE maximization in wireless networks ⋮ MYOPIC POLICIES FOR NON-PREEMPTIVE SCHEDULING OF JOBS WITH DECAYING VALUE ⋮ Optimal sequential replenishment of ships during combat ⋮ Group Maintenance: A Restless Bandits Approach ⋮ BANDIT STRATEGIES EVALUATED IN THE CONTEXT OF CLINICAL TRIALS IN RARE LIFE-THREATENING DISEASES ⋮ Nonstationary Bandits with Habituation and Recovery Dynamics ⋮ Some indexable families of restless bandit problems ⋮ Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation ⋮ An index heuristic for transshipment decisions in multi-location inventory systems based on a pairwise decomposition ⋮ Attack allocation on remote state estimation in multi-systems: structural results and asymptotic solution ⋮ Monotone Policies and Indexability for Bidirectional Restless Bandits ⋮ A survey of computational complexity results in systems and control ⋮ Parameter Dependent Optimal Thresholds, Indifference Levels and Inverse Optimal Stopping Problems ⋮ Scheduling deteriorating jobs on a single machine subject to breakdowns ⋮ Dynamic resource allocation in a multi-product make-to-stock production system ⋮ General notions of indexability for queueing control and asset management ⋮ On the Gittins index in the M/G/1 queue ⋮ Using adaptive learning in credit scoring to estimate take-up probability distribution ⋮ Index policies for the maintenance of a collection of machines by a set of repairmen ⋮ Coupled bisection for root ordering ⋮ An online algorithm for the risk-aware restless bandit ⋮ An asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action bandits ⋮ Marginal Productivity Index Policies for Admission Control and Routing to Parallel Multi-server Loss Queues with Reneging ⋮ Linear programming relaxations and marginal productivity index policies for the buffer sharing problem ⋮ Efficiency in lung transplant allocation strategies ⋮ Asymptotically optimal index policies for an abandonment queue with convex holding cost ⋮ On the dynamic allocation of assets subject to failure ⋮ Two-Armed Restless Bandits with Imperfect Information: Stochastic Control and Indexability ⋮ Spinning plates and squad systems: policies for bi-directional restless bandits ⋮ A Continuous-Time Markov Decision Process for Infrastructure Surveillance ⋮ Gittins' theorem under uncertainty ⋮ Gittins Index for Simple Family of Markov Bandit Processes with Switching Cost and No Discounting ⋮ Resource competition in virtual network embedding ⋮ Learning Unknown Service Rates in Queues: A Multiarmed Bandit Approach ⋮ Resource-constrained management of heterogeneous assets with stochastic deterioration ⋮ Theoretical tools for understanding and aiding dynamic decision making ⋮ Learning, risk attitude and hot stoves in restless bandit problems ⋮ On the Whittle index of Markov modulated restless bandits ⋮ Whittle indexability in egalitarian processor sharing systems ⋮ Robust control of the multi-armed bandit problem ⋮ Unnamed Item ⋮ Time-Constrained Restless Bandits and the Knapsack Problem for Perishable Items (Extended Abstract) ⋮ A General Theory of MultiArmed Bandit Processes with Constrained Arm Switches ⋮ Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges ⋮ Whittle index based Q-learning for restless bandits with average reward ⋮ Algorithmic aspects of mean-variance optimization in Markov decision processes ⋮ A Restless Bandit Model for Resource Allocation, Competition, and Reservation
This page was built for publication: