Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains

countable state space optimal stationary policies average reward optimality equation discrete time Markov decision process bounded measurable reward function simultaneous Doeblin condition

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)

Recommendations

Necessary conditions for the optimality equation in average-reward Markov decision processes
Remarks on the existence of solutions to the average cost optimality equation in Markov decision processes
scientific article; zbMATH DE number 934466
scientific article; zbMATH DE number 3900546
Discounted and average Markov decision processes with unbounded rewards: New conditions

Cites work

scientific article; zbMATH DE number 3560401 (Why is no real title available?)
scientific article; zbMATH DE number 6846220 (Why is no real title available?)
scientific article; zbMATH DE number 3445938 (Why is no real title available?)
scientific article; zbMATH DE number 3320878 (Why is no real title available?)
scientific article; zbMATH DE number 3338194 (Why is no real title available?)
A new condition for the existence of optimal stationary policies in average cost Markov decision processes
A note on simultaneous recurrence conditions on a set of denumerable stochastic matrices
Existence of optimal stationary policies in average reward Markov decision processes with a recurrent state
Necessary conditions for the optimality equation in average-reward Markov decision processes
Two competing queues with linear costs and geometric service requirements: the μc-rule is often optimal

Cited in

(18)

Optimality equations and sensitive optimality in bounded Markov decision processes¹
Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion
Solutions of the average cost optimality equation for Markov decision processes with weakly continuous kernel: the fixed-point approach revisited
On an extremal property of Markov chains and sufficiency of Markov strategies in Markov decision processes with the Dubins-Savage criterion
Recurrence conditions for Markov decision processes with Borel state space: A survey
A note on the vanishing interest rate approach in average Markov decision chains with continuous and bounded costs
Optimality equations and inequalities in a class of risk-sensitive average cost Markov decision chains
A note on the convergence rate of the value iteration scheme in controlled Markov chains
Necessary conditions for the optimality equation in average-reward Markov decision processes
Strong 1-optimal stationary policies in denumerable Markov decision processes
Remarks on the existence of solutions to the average cost optimality equation in Markov decision processes
scientific article; zbMATH DE number 1908240 (Why is no real title available?)
A counterexample on the optimality equation in Markov decision chains with the average cost criterion
Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion
Existence of optimal stationary policies in average reward Markov decision processes with a recurrent state
scientific article; zbMATH DE number 4102842 (Why is no real title available?)
Recent results on conditions for the existence of average optimal stationary policies
On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes

This page was built for publication: Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1103532)