Randomized and Past-Dependent Policies for Markov Decision Processes with Multiple Constraints
From MaRDI portal
Publication:3830833
Recommendations
Cited in
(21)- Optimal Call Admission Control for an IEEE 802.16 Wireless Metropolitan Area Network
- Optimal strategies for managing complex authentication systems
- Splitting randomized stationary policies in total-reward Markov decision processes
- Constrained Semi-Markov decision processes with average rewards
- Optimal mixing of Markov decision rules for MDP control
- Controlled Markov chains with constraints.
- The LP approach in average reward MDPs with multiple cost constraints: The countable state case
- Adaptive control of constrained Markov chains: Criteria and policies
- Concurrent MDPs with Finite Markovian Policies
- Steering policies for controlled Markov chains under a recurrence condition
- Extreme point characterization of constrained nonstationary infinite-horizon Markov decision processes with finite state space
- Bicriterion Optimization of an M/G/1 Queue with A Removable Server
- Controlled diffusions with constraints
- Random search for constrained Markov decision processes with multi-policy improvement
- Selecting malaria interventions: a top-down approach
- Average Reward Markov Decision Processes with Multiple Cost Constraints
- Resource-constrained management of heterogeneous assets with stochastic deterioration
- Control policies for two-stage, pull-type production/inventory systems with constrained average cost criterion
- Sensitivity of constrained Markov decision processes
- Multiple objective nonatomic Markov decision processes with total reward criteria
- Multichain Markov Decision Processes with a Sample Path Constraint: A Decomposition Approach
This page was built for publication: Randomized and Past-Dependent Policies for Markov Decision Processes with Multiple Constraints
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3830833)