Average optimality for continuous-time Markov decision processes with a policy iteration approach (Q2465179): Difference between revisions

The paper deals with the average expected reward criterion for continuous-time Markov decision processes in general state and action spaces. The transition rates of underlying continuous-time jump Markov processes are allowed to be unbounded, and the reward rates may have neither upper nor lower bounds. The author gives conditions on the system's primitive data and under which he proves the existence of the average reward optimality equation and an average optimal stationary policy. Also, under proposed conditions the author ensures the existence of \(\epsilon\)-average optimal stationary polices. Moreover, the author studies some properties of average optimal stationary polices. The author not only establishes another average optimality equation on an average optimal stationary policy, but also presents an interesting ``martingale characterization'' of such a policy. The approach presented in the paper is based on the policy iteration algorithm and is different from those (``vanishing discounting factor approach'', ``optimality inequality approach'') usually used in the literature. References contain \(31\) items.

0 references

reviewed by

Wiesław Kotarski

0 references

zbMATH Keywords

continuous-time Markov decision processes

0 references

policy iteration algorithm

0 references

average criterion

0 references

optimality equation

0 references

optimal stationary policy

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1016/j.jmaa.2007.06.071

0 references

cites work

Equivalence of exponential ergodicity and \(L^ 2\)-exponential convergence for Markov chains.

0 references

Continuous time control of Markov processes on an arbitrary state space: Discounted rewards

0 references

Exponential and uniform ergodicity of Markov processes

0 references

Drift and monotonicity conditions for continuous-time controlled markov chains with an average criterion

0 references

A note on optimality conditions for continuous-time Markov decision processes with average cost criterion

0 references

Q5463021

0 references

Q4534402

0 references

Average optimality for continuous-time Markov decision processes in Polish spaces

0 references

Bias Optimality in Controlled Queueing Systems

0 references

Q4891056

0 references

Q4255598

0 references

Q3266141

0 references

Nondiscounted Continuous Time Markovian Decision Process with Countable State Space

0 references

On maximal rewards and \(\varepsilon\)-optimal policies in continuous time Markov decision chains

0 references

A note on bias optimality in controlled queueing systems

0 references

Computable exponential convergence rates for stochastically ordered Markov processes

0 references

Finite state continuous time Markov decision processes with an infinite planning horizon

0 references

Q4223191

0 references

Criteria for ergodicity, exponential ergodicity and strong ergodicity of Markov processes

0 references

On Homogeneous Markov Models with Continuous Time and Finite or Countable State Space

0 references

Continuous time Markov decision programming with average reward criterion and unbounded reward rate

0 references

Bias optimality and strong \(n\) \((n= -1,0)\) discount optimality for Markov decision processes

0 references

Average optimality inequality for continuous-time Markov decision processes in Polish spaces

0 references

Markov Decision Processes with Variance Minimization: A New Condition and Approach

0 references

Unbounded cost Markov decision processes with limsup and liminf average criteria: new conditions

0 references

Another Set of Conditions for Strong<i>n</i>(<i>n</i> = −1, 0) Discount Optimality in Markov Decision Processes

0 references

Another set of conditions for Markov decision processes with average sample-path costs

0 references

Identifiers

zbMATH Open document ID

1156.90023

0 references

DOI

10.1016/j.jmaa.2007.06.071

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2465179

@@ Property / full work available at URL @@
+https://doi.org/10.1016/j.jmaa.2007.06.071
+Normal rank
@@ Property / OpenAlex ID @@
+W2073471284
@@ Property / OpenAlex ID: W2073471284 / rank @@
+Normal rank
@@ Property / cites work @@
+Equivalence of exponential ergodicity and \(L^ 2\)-exponential convergence for Markov chains.
+Normal rank
@@ Property / cites work @@
+Continuous time control of Markov processes on an arbitrary state space: Discounted rewards
+Normal rank
@@ Property / cites work @@
+Exponential and uniform ergodicity of Markov processes
+Normal rank
@@ Property / cites work @@
+Drift and monotonicity conditions for continuous-time controlled markov chains with an average criterion
+Normal rank
@@ Property / cites work @@
+A note on optimality conditions for continuous-time Markov decision processes with average cost criterion
+Normal rank
@@ Property / cites work @@
+Q5463021
@@ Property / cites work: Q5463021 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4534402
@@ Property / cites work: Q4534402 / rank @@
+Normal rank
@@ Property / cites work @@
+Average optimality for continuous-time Markov decision processes in Polish spaces
+Normal rank
@@ Property / cites work @@
+Bias Optimality in Controlled Queueing Systems
@@ Property / cites work: Bias Optimality in Controlled Queueing Systems / rank @@
+Normal rank
@@ Property / cites work @@
+Q4891056
@@ Property / cites work: Q4891056 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4255598
@@ Property / cites work: Q4255598 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3266141
@@ Property / cites work: Q3266141 / rank @@
+Normal rank
@@ Property / cites work @@
+Nondiscounted Continuous Time Markovian Decision Process with Countable State Space
+Normal rank
@@ Property / cites work @@
+On maximal rewards and \(\varepsilon\)-optimal policies in continuous time Markov decision chains
+Normal rank
@@ Property / cites work @@
+A note on bias optimality in controlled queueing systems
+Normal rank
@@ Property / cites work @@
+Computable exponential convergence rates for stochastically ordered Markov processes
+Normal rank
@@ Property / cites work @@
+Finite state continuous time Markov decision processes with an infinite planning horizon
+Normal rank
@@ Property / cites work @@
+Q4223191
@@ Property / cites work: Q4223191 / rank @@
+Normal rank
@@ Property / cites work @@
+Criteria for ergodicity, exponential ergodicity and strong ergodicity of Markov processes
+Normal rank
@@ Property / cites work @@
+On Homogeneous Markov Models with Continuous Time and Finite or Countable State Space
+Normal rank
@@ Property / cites work @@
+Continuous time Markov decision programming with average reward criterion and unbounded reward rate
+Normal rank
@@ Property / cites work @@
+Bias optimality and strong \(n\) \((n= -1,0)\) discount optimality for Markov decision processes
+Normal rank
@@ Property / cites work @@
+Average optimality inequality for continuous-time Markov decision processes in Polish spaces
+Normal rank
@@ Property / cites work @@
+Markov Decision Processes with Variance Minimization: A New Condition and Approach
+Normal rank
@@ Property / cites work @@
+Unbounded cost Markov decision processes with limsup and liminf average criteria: new conditions
+Normal rank
@@ Property / cites work @@
+Another Set of Conditions for Strong<i>n</i>(<i>n</i> = −1, 0) Discount Optimality in Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+Another set of conditions for Markov decision processes with average sample-path costs
+Normal rank