Functional equations in the theory of dynamic programming. XI: Limit theorems (Q773497): Difference between revisions

Let \(p\in P\) be a state vector of a discrete process and \(q\in Q\) a decision variable, which transforms \(p\) into \(T(p,q)\in P\). The transformation results in a ``return'' \(b(p, q)\ge 0\). Given \(p_1\), it is required to choose \(q = q_1, \ldots, q = q_n\) such that \(R_n = \sum_{i=1}^n b(p_i, q_i)\) is maximized, when \(p_{i+1} = T(p_i, q_i)\). The author proves that if it is possible for any \(p_a, p_b\) to find a \(q\in Q\) such that \(T(p_a, q) = p_b\), then for all \(p_1\in P\) we have \(\max_q R_N \sim Na\) as \(N\to\infty\), where \(a\) is independent of \(p_1\). It is mentioned that the existence of an asymptotic policy (i. e. choices of \(q)\) has not been proved. (For part X, written together with \textit{S. Lehman} see [Duke Math. J. 27, 55--69 (1960; Zbl 0096.14502)].)

0 references

Mathematics Subject Classification ID

90C39

0 references

0 references

0 references

0 references

functional equations

0 references

dynamic programming

0 references

limit theorems

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:773497

Revision as of 11:46, 30 January 2024 Import240129110113 (talk \| contribs) Bots 7,163,963 edits Added link to MaRDI item. ← Older edit		Revision as of 06:20, 20 February 2024 RedirectionBot (talk \| contribs) Bots 2,880,369 edits ‎Removed claim: author (P16): Item:Q594813 Newer edit →
Property / author
	~~Richard Bellman~~
Property / author: Richard Bellman / rank
	~~Normal rank~~