On linear programming for constrained and unconstrained average-cost Markov decision processes with countable action spaces and strictly unbounded costs

DOI10.1287/MOOR.2021.1177zbMATH Open1489.90211arXiv1905.12095OpenAlexW2947183726MaRDI QIDQ5085149FDOQ5085149

Publication date: 27 June 2022

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Abstract: We consider the linear programming approach for constrained and unconstrained Markov decision processes (MDPs) under the long-run average cost criterion, where the class of MDPs in our study have Borel state spaces and discrete countable action spaces. Under a strict unboundedness condition on the one-stage costs and a recently introduced majorization condition on the state transition stochastic kernel, we study infinite-dimensional linear programs for the average-cost MDPs and prove the absence of a duality gap and other optimality results. Our results do not require a lower-semicontinuous MDP model. Thus, they can be applied to countable action space MDPs where the dynamics and one-stage costs are discontinuous in the state variable. Our proofs make use of the continuity property of Borel measurable functions asserted by Lusin's theorem.

Full work available at URL: https://arxiv.org/abs/1905.12095

Recommendations

zbMATH Keywords

constraints duality Markov decision processes average cost Borel state space infinite-dimensional linear programs countable action space majorization condition minimum pair

Mathematics Subject Classification ID

Linear programming (90C05) Optimality conditions and duality in mathematical programming (90C46) Markov and semi-Markov decision processes (90C40) Programming in abstract spaces (90C48) Optimal stochastic control (93E20)

Cites Work

Cited In (3)

This page was built for publication: On linear programming for constrained and unconstrained average-cost Markov decision processes with countable action spaces and strictly unbounded costs

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5085149)