Average cost Markov decision processes with weakly continuous transition probabilities
From MaRDI portal
Publication:2925348
Abstract: This paper presents sufficient conditions for the existence of stationary optimal policies for average-cost Markov Decision Processes with Borel state and action sets and with weakly continuous transition probabilities. The one-step cost functions may be unbounded, and action sets may be noncompact. The main contributions of this paper are: (i) general sufficient conditions for the existence of stationary discount-optimal and average-cost optimal policies and descriptions of properties of value functions and sets of optimal actions, (ii) a sufficient condition for the average-cost optimality of a stationary policy in the form of optimality inequalities, and (iii) approximations of average-cost optimal actions by discount-optimal actions.
Recommendations
- Average cost Markov decision processes with semi-uniform Feller transition probabilities
- Average cost Markov decision processes: Optimality conditions
- Average optimality for continuous-time Markov decision processes under weak continuity conditions
- Average cost Markov decision processes under the hypothesis of Doeblin
- Partially observable total-cost Markov decision processes with weakly continuous transition probabilities
- Continuous-time Markov decision processes under the risk-sensitive average cost criterion
- Denumerable continuous-time Markov decision processes with multiconstraints on average costs
- Average Reward Markov Decision Processes with Multiple Cost Constraints
- Markov Decision Processes with a Borel Measurable Cost Function—The Average Case
Cites work
- scientific article; zbMATH DE number 3906790 (Why is no real title available?)
- A Counterexample on the Semicontinuity of Minima
- A counterexample on the optimality equation in Markov decision chains with the average cost criterion
- Arbitrary State Markovian Decision Processes
- Average Optimality in Dynamic Programming with General State Space
- Average optimality in dynamic programming on Borel spaces -- unbounded costs and controls
- Compactness of the space of non-randomized policies in countable-state sequential decision processes
- Discrete Dynamic Programming
- Fatou's lemma and Lebesgue's convergence theorem for measures
- Fatou's lemma for weakly converging probabilities
- Markovian Sequential Replacement Processes
- Non-Discounted Denumerable Markovian Decision Models
- OPTIMALITY OF FOUR-THRESHOLD POLICIES IN INVENTORY SYSTEMS WITH CUSTOMER RETURNS AND BORROWING/STORAGE OPTIONS
- On sequential decisions and Markov chains
- On the Nonexistence of $|varepsilon$-Optimal Randomized Stationary Policies in Average Cost Markov Decision Models
- Optimal decision procedures for finite markov chains. Part I: Examples
- Optimality Inequalities for Average Cost Markov Decision Processes and the Stochastic Cash Balance Problem
Cited in
(51)- Uniform Fatou's lemma
- Markov decision processes with incomplete information and semiuniform Feller transition probabilities
- The average cost of Markov chains subject to total variation distance uncertainty
- Fatou's lemma for weakly converging measures under the uniform integrability condition
- A mixed value and policy iteration method for stochastic control with universally measurable policies
- Berge's maximum theorem for noncompact image sets
- LP based upper and lower bounds for Cesàro and Abel limits of the optimal values in problems of control of stochastic discrete time systems
- The Existence of a Minimum Pair of State and Policy for Markov Decision Processes under the Hypothesis of Doeblin
- Average optimality for continuous-time Markov decision processes under weak continuity conditions
- Average Optimality in Dynamic Programming with General State Space
- scientific article; zbMATH DE number 94713 (Why is no real title available?)
- Examples concerning Abel and Cesàro limits
- On the vanishing discount factor approach for Markov decision processes with weakly continuous transition probabilities
- Partially observable total-cost Markov decision processes with weakly continuous transition probabilities
- On the reduction of total-cost and average-cost MDPs to discounted mdps
- Unbounded dynamic programming via the Q-transform
- Planning for the long run: programming with patient, Pareto responsive preferences
- Stochastic setup-cost inventory model with backorders and quasiconvex cost functions
- On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs
- Solutions of the average cost optimality equation for Markov decision processes with weakly continuous kernel: the fixed-point approach revisited
- Approximation of average cost Markov decision processes using empirical distributions and concentration inequalities
- A note on the existence of optimal stationary policies for average Markov decision processes with countable states
- Reduction of total-cost and average-cost MDPs with weakly continuous transition probabilities to discounted mdps
- A survey of average cost problems in deterministic discrete-time control systems
- On convergence of value iteration for a class of total cost Markov decision processes
- Near optimality of quantized policies in stochastic control under weak continuity conditions
- Constrained Markov decision processes in Borel spaces: from discounted to average optimality
- A useful technique for piecewise deterministic Markov decision processes
- Average cost Markov decision processes: Optimality conditions
- Functional characterization for average cost Markov decision processes with Doeblin's conditions
- Continuity of discounted values and the structure of optimal policies for <scp>periodic‐review</scp> inventory systems with setup costs
- Fatou's lemma in its classical form and Lebesgue's convergence theorems for varying measures with applications to Markov decision processes
- Another set of conditions for Markov decision processes with average sample-path costs
- MDPs with setwise continuous transition probabilities
- Convergence of probability measures and Markov decision models with incomplete information
- Continuity of minima: local results
- On strong average optimality of Markov decision processes with unbounded costs
- Average cost Markov decision processes with semi-uniform Feller transition probabilities
- Optimality conditions for partially observable Markov decision processes
- Berge's theorem for noncompact image sets
- Formalization of methods for the development of autonomous artificial intelligence systems
- scientific article; zbMATH DE number 7625164 (Why is no real title available?)
- New discount and average optimality conditions for continuous-time Markov decision processes
- Convex analytic method revisited: further optimality results and performance of deterministic policies in average cost stochastic control
- Another look at partially observed optimal stochastic control: existence, ergodicity, and approximations without belief-reduction
- scientific article; zbMATH DE number 4102842 (Why is no real title available?)
- Average cost optimality inequality for Markov decision processes with Borel spaces and universally measurable policies
- On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies
- Continuity of equilibria for two-person zero-sum games with noncompact action sets and unbounded payoffs
- On the optimality equation for average cost Markov decision processes and its validity for inventory control
- Structure of optimal policies to periodic-review inventory models with convex costs and backorders for all values of discount factors
This page was built for publication: Average cost Markov decision processes with weakly continuous transition probabilities
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2925348)