Average optimality for continuous-time Markov decision processes with a policy iteration approach

From MaRDI portal
Publication:2465179

DOI10.1016/j.jmaa.2007.06.071zbMath1156.90023OpenAlexW2073471284MaRDI QIDQ2465179

Quanxin Zhu

Publication date: 8 January 2008

Published in: Journal of Mathematical Analysis and Applications (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.jmaa.2007.06.071




Related Items (18)

Optimal Control of Piecewise Deterministic Markov ProcessesHamilton-Jacobi-Bellman inequality for the average control of piecewise deterministic Markov processesExact decomposition approaches for Markov decision processes: a surveyAnother set of verifiable conditions for average Markov decision processes with Borel spacesStrong \(n\)-discount and finite-horizon optimality for continuous-time Markov decision processesAverage sample-path optimality for continuous-time Markov decision processes in Polish spacesBias and Overtaking Optimality for Continuous-Time Jump Markov Decision Processes in Polish SpacesPolicy iteration algorithms for zero-sum stochastic differential games with long-run average payoff criteriaDenumerable continuous-time Markov decision processes with multiconstraints on average costsPolicy iteration for continuous-time average reward Markov decision processes in Polish spacesVariance minimization for continuous-time Markov decision processes: two approachesStationary analysis of the infinite-server queue modulated by a multi-phase Markovian environmentNew sufficient conditions for average optimality in continuous-time Markov decision processesNew discount and average optimality conditions for continuous-time Markov decision processesThe Vanishing Discount Approach for the Average Continuous Control of Piecewise Deterministic Markov ProcessesAbsorbing Continuous-Time Markov Decision Processes with Total Cost CriteriaAverage optimality for continuous-time Markov decision processes under weak continuity conditionsUnnamed Item



Cites Work


This page was built for publication: Average optimality for continuous-time Markov decision processes with a policy iteration approach