Randomized and Past-Dependent Policies for Markov Decision Processes with Multiple Constraints
From MaRDI portal
Publication:3830833
DOI10.1287/opre.37.3.474zbMath0675.90092OpenAlexW2001139238MaRDI QIDQ3830833
Publication date: 1989
Published in: Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/opre.37.3.474
optimal stationary policyfinite action spacefinite state spacelong-run average rewardlimited randomizationround-robin type policy
Related Items
Selecting malaria interventions: a top-down approach, Constrained Semi-Markov decision processes with average rewards, Optimal strategies for managing complex authentication systems, The LP approach in average reward MDPs with multiple cost constraints: The countable state case, Controlled diffusions with constraints, Adaptive control of constrained Markov chains: Criteria and policies, Sensitivity of constrained Markov decision processes, Control policies for two-stage, pull-type production/inventory systems with constrained average cost criterion, Extreme point characterization of constrained nonstationary infinite-horizon Markov decision processes with finite state space, Controlled Markov chains with constraints., Optimal Call Admission Control for an IEEE 802.16 Wireless Metropolitan Area Network, Average Reward Markov Decision Processes with Multiple Cost Constraints, Bicriterion Optimization of an M/G/1 Queue with A Removable Server, OPTIMAL MIXING OF MARKOV DECISION RULES FOR MDP CONTROL, Multiple objective nonatomic Markov decision processes with total reward criteria, Resource-constrained management of heterogeneous assets with stochastic deterioration