The Existence of a Minimum Pair of State and Policy for Markov Decision Processes under the Hypothesis of Doeblin

DOI10.1137/0327016MaRDI QIDQ3833894zbMATH OpenOpenAlexFDO

Authors Masami Kurano

Publication date 1989

Published in SIAM Journal on Control and Optimization (Search for Journal in Brave)

Full work available at URL https://doi.org/10.1137/0327016

zbMATH Keywords

Doeblin condition average-cost Markov decision processes stationary, uniformly optimal policy

Mathematics Subject Classification ID

Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40)

Recommendations

Average cost Markov decision processes under the hypothesis of Doeblin
Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs
Functional characterization for average cost Markov decision processes with Doeblin's conditions
scientific article; zbMATH DE number 3854831
Average cost Markov decision processes with weakly continuous transition probabilities

Cited in

(21)

This page was built for publication: The Existence of a Minimum Pair of State and Policy for Markov Decision Processes under the Hypothesis of Doeblin

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3833894)