On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs
From MaRDI portal
Publication:5220188
Abstract: We consider average-cost Markov decision processes (MDPs) with Borel state spaces, countable, discrete action spaces, and strictly unbounded one-stage costs. For the minimum pair approach, we introduce a new majorization condition on the state transition stochastic kernel, in place of the commonly required continuity conditions on the MDP model. We combine this majorization condition with Lusin's theorem to prove the existence of a stationary minimum pair, i.e., a stationary policy paired with an invariant probability measure induced on the state space, with the property that the pair attains the minimum long-run average cost over all policies and initial distributions. We also establish other optimality properties of a stationary minimum pair, and for the stationary policy in such a pair, under additional recurrence or regularity conditions, we prove its pathwise optimality and strong optimality. Our results can be applied to a class of countable action space MDPs in which the dynamics and one-stage costs are discontinuous with respect to the state variable.
Recommendations
- On linear programming for constrained and unconstrained average-cost Markov decision processes with countable action spaces and strictly unbounded costs
- Exact finite approximations of average-cost countable Markov decision processes
- On strong average optimality of Markov decision processes with unbounded costs
- Denumerable continuous-time Markov decision processes with multiconstraints on average costs
- Approximation of average cost optimal policies for general Markov decision processes with unbounded costs
- Value iteration in countable state average cost Markov decision processes with unbounded costs
- Countable state Markov decision processes with unbounded jump rates and discounted cost: optimality equation and approximations
- Average cost Markov decision processes with weakly continuous transition probabilities
- Finite-State Approximations to Discounted and Average Cost Constrained Markov Decision Processes
Cites work
- scientific article; zbMATH DE number 3723610 (Why is no real title available?)
- scientific article; zbMATH DE number 1325008 (Why is no real title available?)
- scientific article; zbMATH DE number 513084 (Why is no real title available?)
- scientific article; zbMATH DE number 700091 (Why is no real title available?)
- scientific article; zbMATH DE number 1786124 (Why is no real title available?)
- scientific article; zbMATH DE number 837313 (Why is no real title available?)
- scientific article; zbMATH DE number 3245885 (Why is no real title available?)
- scientific article; zbMATH DE number 3274494 (Why is no real title available?)
- A mixed value and policy iteration method for stochastic control with universally measurable policies
- Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs
- Average Optimality in Dynamic Programming with General State Space
- Average cost Markov decision processes with weakly continuous transition probabilities
- Average optimality in dynamic programming on Borel spaces -- unbounded costs and controls
- General Irreducible Markov Chains and Non-Negative Operators
- Markov Chains and Stochastic Stability
- On Linear Programming in a Markov Decision Problem
- On Minimum Cost Per Unit Time Control of Markov Chains
- Real Analysis and Probability
- Sample path average optimality of Markov control processes with strictly unbounded cost
- Sample-path average optimality for Markov control processes
- Stochastic optimal control. The discrete time case
- The Existence of a Minimum Pair of State and Policy for Markov Decision Processes under the Hypothesis of Doeblin
- The policy iteration algorithm for average reward Markov decision processes with general state space
- Universally Measurable Policies in Dynamic Programming
- Weak conditions for average optimality in Markov control processes
Cited in
(5)- A survey of average cost problems in deterministic discrete-time control systems
- Convex analytic method revisited: further optimality results and performance of deterministic policies in average cost stochastic control
- Average cost optimality inequality for Markov decision processes with Borel spaces and universally measurable policies
- On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies
- On linear programming for constrained and unconstrained average-cost Markov decision processes with countable action spaces and strictly unbounded costs
This page was built for publication: On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5220188)