The Existence of a Minimum Pair of State and Policy for Markov Decision Processes under the Hypothesis of Doeblin

From MaRDI portal
Publication:3833894

DOI10.1137/0327016zbMath0677.90085OpenAlexW2087896522MaRDI QIDQ3833894

Masami Kurano

Publication date: 1989

Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1137/0327016




Related Items

Existence of optimal stationary policies in discounted Markov decision processes: Approaches by occupation measuresRecurrence conditions for Markov decision processes with Borel state space: A surveyOn the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processesOptimal service control against worst case admission policies: A multichained stochastic gameAsymptotic behavior of continuous stochastic gamesOn Linear Programming for Constrained and Unconstrained Average-Cost Markov Decision Processes with Countable Action Spaces and Strictly Unbounded CostsLinear programming formulation of MDPs in countable state space: The multichain caseAverage cost Markov decision processes under the hypothesis of DoeblinAverage cost Markov decision processes: Optimality conditionsFunctional characterization for average cost Markov decision processes with Doeblin's conditionsOn the comparison of the stability and control problem of differential systemsUnnamed ItemAverage cost optimal policies for Markov control processes with Borel state space and unbounded costsOn the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded CostsRemarks on the existence of solutions to the average cost optimality equation in Markov decision processesAverage Reward Markov Decision Processes with Multiple Cost ConstraintsOn structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policiesConvex analytic method revisited: further optimality results and performance of deterministic policies in average cost stochastic controlOptimal control problem for the Lyapunov exponents of random matrix productsConstrained markov decision processes with compact state and action spaces: the average case