Nonstationary value-iteration and adaptive control of discounted semi- Markov processes (Q1068732): Difference between revisions

We consider in this paper discounted-reward, denumerable state space, semi-Markov decision processes which depend on unknown parameters. The problems we are interested in are: Given that the true parameter value is unknown, (I) give an iterative scheme to determine the total maximal discounted reward, and (II) find an asymptotically discount optimal (adaptive) policy. Our solutions are inspired by the nonstationary value iteration (NVI) scheme of \textit{A. Federgruen} and \textit{P. J. Schweitzer} [J. Optimization Theory Appl. 34, 207-241 (1981; Zbl 0426.90091)] combined with the ideas of \textit{M. Schäl} [in: Optimization, theory and algorithms, Conf. Confolant/France 1981, Lect. Notes Pure Appl. Math. 86, 239-253 (1983; Zbl 0525.93071)] concerning the ''principle of estimation and control'' for the adaptive control of semi-Markov processes.

0 references

zbMATH Keywords

discounted-reward

0 references

denumerable state space

0 references

semi-Markov decision processes

0 references

unknown parameters

0 references

nonstationary value iteration

0 references

principle of estimation and control

0 references

adaptive control of semi-Markov processes

0 references

MaRDI profile type

MaRDI publication profile

0 references

Identifiers

zbMATH Open document ID

0581.90096

0 references

DOI

10.1016/0022-247X(85)90253-7

0 references

Mathematics Subject Classification ID

90C40

0 references

zbMATH DE Number

3930759

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1068732

Revision as of 23:54, 30 January 2024 Import240129110113 (talk \| contribs) Bots 7,163,963 edits Added link to MaRDI item. ← Older edit	Revision as of 02:04, 5 March 2024 Import240304020342 (talk \| contribs) 4,416,906 edits Set profile property. Newer edit →
	Property / MaRDI profile type
		MaRDI publication profile
	Property / MaRDI profile type: MaRDI publication profile / rank
		Normal rank