Continuous time Markov decision programming with average reward criterion and unbounded reward rate (Q1179405): Difference between revisions

Markov decision problems with continuous time and unbounded reward rates are studied for countable state sets and compact metric action sets. The transitive law is described by a controlled conservative transition rate matrix. For these problems the average expected reward is to be maximized under some (time dependent) deterministic Markov strategies where the resulting transition probabilities are continuous in time. Additional assumptions are given to obtain the existence of stationary optimal policies. The essential arguments are based on an imbedded finite state Markov decision chain with bounded rewards.

0 references

zbMATH Keywords

continuous time

0 references

unbounded reward

0 references

countable state sets

0 references

compact metric action sets

0 references

average expected reward

0 references

MaRDI profile type

MaRDI publication profile

0 references

Identifiers

zbMATH Open document ID

0752.90083

0 references

DOI

10.1007/BF02080199

0 references

Mathematics Subject Classification ID

90C40

0 references

zbMATH DE Number

24568

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1179405

Revision as of 23:53, 29 January 2024 Import240129110155 (talk \| contribs) 399,160 edits Added link to MaRDI item. ← Older edit	Revision as of 23:34, 4 March 2024 Import240304020342 (talk \| contribs) 4,416,906 edits Set profile property. Newer edit →
	Property / MaRDI profile type
		MaRDI publication profile
	Property / MaRDI profile type: MaRDI publication profile / rank
		Normal rank