Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

From MaRDI portal
Publication:2925348


DOI10.1287/moor.1120.0555zbMath1297.90173arXiv1202.4122MaRDI QIDQ2925348

Eugene A. Feinberg, Nina V. Zadoianchuk (Zadoyanchuk), Pavlo O. Kasyanov

Publication date: 21 October 2014

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1202.4122


90C39: Dynamic programming

90C40: Markov and semi-Markov decision processes


Related Items

Unnamed Item, STOCHASTIC SETUP-COST INVENTORY MODEL WITH BACKORDERS AND QUASICONVEX COST FUNCTIONS, Markov Decision Processes with Incomplete Information and Semiuniform Feller Transition Probabilities, Fatou's Lemma in Its Classical Form and Lebesgue's Convergence Theorems for Varying Measures with Applications to Markov Decision Processes, Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies, Average Cost Markov Decision Processes with Semi-Uniform Feller Transition Probabilities, Fatou's Lemma for Weakly Converging Measures under the Uniform Integrability Condition, On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs, Average optimality for continuous-time Markov decision processes under weak continuity conditions, On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes, Formalization of methods for the development of autonomous artificial intelligence systems, Continuity of discounted values and the structure of optimal policies for <scp>periodic‐review</scp> inventory systems with setup costs, A note on the existence of optimal stationary policies for average Markov decision processes with countable states, Examples concerning Abel and Cesàro limits, Convergence of probability measures and Markov decision models with incomplete information, Continuity of minima: local results, Constrained Markov decision processes in Borel spaces: from discounted to average optimality, Uniform Fatou's lemma, Berge's theorem for noncompact image sets, LP based upper and lower bounds for Cesàro and Abel limits of the optimal values in problems of control of stochastic discrete time systems, Near optimality of quantized policies in stochastic control under weak continuity conditions, Solutions of the average cost optimality equation for Markov decision processes with weakly continuous kernel: the fixed-point approach revisited, Planning for the long run: programming with patient, Pareto responsive preferences, MDPs with setwise continuous transition probabilities, On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies, Convex analytic method revisited: further optimality results and performance of deterministic policies in average cost stochastic control, Structure of optimal policies to periodic-review inventory models with convex costs and backorders for all values of discount factors, Continuity of equilibria for two-person zero-sum games with noncompact action sets and unbounded payoffs, On the optimality equation for average cost Markov decision processes and its validity for inventory control, Unbounded dynamic programming via the Q-transform, On the vanishing discount factor approach for Markov decision processes with weakly continuous transition probabilities, Berge's maximum theorem for noncompact image sets, Reduction of total-cost and average-cost MDPs with weakly continuous transition probabilities to discounted mdps, A useful technique for piecewise deterministic Markov decision processes, A survey of average cost problems in deterministic discrete-time control systems, Partially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition Probabilities, Optimality Conditions for Partially Observable Markov Decision Processes, On the reduction of total‐cost and average‐cost MDPs to discounted MDPs, A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies



Cites Work