The Bellman's principle of optimality in the discounted dynamic programming (Q1112737)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | The Bellman's principle of optimality in the discounted dynamic programming |
scientific article |
Statements
The Bellman's principle of optimality in the discounted dynamic programming (English)
0 references
1987
0 references
The author presents a short proof of Bellman's optimality principle in discounted dynamic programming, which states that the policy \(\pi\) is optimal if and only if its reward I(\(\pi)\) satisfies the optimality equation. The given proof is based on the properties of the conditional expectation. Some further applications of the author's technique are proposed.
0 references
Bellman's optimality principle
0 references
discounted dynamic programming
0 references