Continuous time Markov decision programming with average reward criterion and unbounded reward rate (Q1179405): Difference between revisions

Markov decision problems with continuous time and unbounded reward rates are studied for countable state sets and compact metric action sets. The transitive law is described by a controlled conservative transition rate matrix. For these problems the average expected reward is to be maximized under some (time dependent) deterministic Markov strategies where the resulting transition probabilities are continuous in time. Additional assumptions are given to obtain the existence of stationary optimal policies. The essential arguments are based on an imbedded finite state Markov decision chain with bounded rewards.

0 references

zbMATH Keywords

continuous time

0 references

unbounded reward

0 references

countable state sets

0 references

compact metric action sets

0 references

average expected reward

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Continuous time control of Markov processes on an arbitrary state space: average return criterion

0 references

Q4712090

0 references

Q3908791

0 references

Q5524074

0 references

Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion

0 references

full work available at URL

https://doi.org/10.1007/bf02080199

0 references

Identifiers

zbMATH Open document ID

0752.90083

0 references

DOI

10.1007/BF02080199

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1179405

@@ Property / full work available at URL @@
+https://doi.org/10.1007/bf02080199
+Normal rank
@@ Property / OpenAlex ID @@
+W1990651907
@@ Property / OpenAlex ID: W1990651907 / rank @@
+Normal rank