Randomized and Past-Dependent Policies for Markov Decision Processes with Multiple Constraints
From MaRDI portal
Publication:3830833
DOI10.1287/opre.37.3.474zbMath0675.90092OpenAlexW2001139238MaRDI QIDQ3830833
Publication date: 1989
Published in: Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/opre.37.3.474
optimal stationary policyfinite action spacefinite state spacelong-run average rewardlimited randomizationround-robin type policy
Related Items
Selecting malaria interventions: a top-down approach ⋮ Constrained Semi-Markov decision processes with average rewards ⋮ Optimal strategies for managing complex authentication systems ⋮ The LP approach in average reward MDPs with multiple cost constraints: The countable state case ⋮ Controlled diffusions with constraints ⋮ Adaptive control of constrained Markov chains: Criteria and policies ⋮ Sensitivity of constrained Markov decision processes ⋮ Control policies for two-stage, pull-type production/inventory systems with constrained average cost criterion ⋮ Extreme point characterization of constrained nonstationary infinite-horizon Markov decision processes with finite state space ⋮ Controlled Markov chains with constraints. ⋮ Optimal Call Admission Control for an IEEE 802.16 Wireless Metropolitan Area Network ⋮ Average Reward Markov Decision Processes with Multiple Cost Constraints ⋮ Bicriterion Optimization of an M/G/1 Queue with A Removable Server ⋮ OPTIMAL MIXING OF MARKOV DECISION RULES FOR MDP CONTROL ⋮ Multiple objective nonatomic Markov decision processes with total reward criteria ⋮ Resource-constrained management of heterogeneous assets with stochastic deterioration
This page was built for publication: Randomized and Past-Dependent Policies for Markov Decision Processes with Multiple Constraints