Optimal policies for controlled Markov chains with a constraint (Q1068783): Difference between revisions

The authors deal with the dynamic optimization of discrete-time Markovian systems. It is well known that the behaviour of many systems in practice can be described, from a mathematical point of view, by Markov systems. For example, we can mention computer-communication networks, production operations, computer operating systems and macroeconomic systems. However, the aim of the paper is to discuss basic questions of the former optimization problem with constraints. Assumptions under which an optimal policy exists are given in the paper. Further, it is shown that this policy always stationary and either non-randomized stationary, (i.e. simple) or consists of a mix of two non-randomized policies, equivalent to choosing independently one of two simple policies at each time by the toss of a (biased) coin. Lagrangian multiplier techniques are used to derive the mentioned results.

0 references

zbMATH Keywords

dynamic optimization

0 references

discrete-time Markovian systems

0 references

optimal policy

0 references

Lagrangian multiplier techniques

0 references

0 references

0 references

Identifiers

zbMATH Open document ID

0581.93067

0 references

DOI

10.1016/0022-247X(85)90288-4

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1068783

Revision as of 11:42, 13 February 2024 RedirectionBot (talk \| contribs) Bots 2,880,369 edits ‎Changed an Item ← Older edit	Revision as of 03:04, 5 March 2024 Import240304020342 (talk \| contribs) 4,416,906 edits Set profile property. Newer edit →
	Property / MaRDI profile type
		Publication
	Property / MaRDI profile type: Publication / rank
		Normal rank