Nonuniqueness versus uniqueness of optimal policies in convex discounted Markov decision processes (Q2375462): Difference between revisions

Summary: From the classical point of view, it is important to determine if in a Markov decision process (MDP), besides their existence, the uniqueness of the optimal policies is guaranteed. It is well known that uniqueness does not always hold in optimization problems (for instance, in linear programming). On the other hand, in such problems it is possible for a slight perturbation of the functional cost to restore the uniqueness. In this paper, it is proved that the value functions of an MDP and its cost perturbed version stay close, under adequate conditions, which in some sense is a priority. We are interested in the stability of Markov decision processes with respect to the perturbations of the cost-as-you-go function.

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1155/2013/271279

0 references

cites work

Conditions for the uniqueness of optimal policies of discounted Markov decision processes

0 references

Average Optimality in Markov Control Processes via Discounted-Cost Problems and Linear Programming

0 references

Note—A Note on Dynamic Programming with Unbounded Rewards

0 references

Q3994363

0 references

Q4844751

0 references

Identifiers

zbMATH Open document ID

1266.90113

0 references

DOI

10.1155/2013/271279

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2375462

@@ Property / author @@
-Raúl Montes-De-oca
@@ Property / author: Raúl Montes-De-oca / rank @@
-Normal rank
@@ Property / author @@
+Raúl Montes-De-oca
@@ Property / author: Raúl Montes-De-oca / rank @@
+Normal rank
@@ Property / Wikidata QID @@
+Q59002487
@@ Property / Wikidata QID: Q59002487 / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1155/2013/271279
+Normal rank
@@ Property / OpenAlex ID @@
+W2001225281
@@ Property / OpenAlex ID: W2001225281 / rank @@
+Normal rank
@@ Property / cites work @@
+Conditions for the uniqueness of optimal policies of discounted Markov decision processes
+Normal rank
@@ Property / cites work @@
+Average Optimality in Markov Control Processes via Discounted-Cost Problems and Linear Programming
+Normal rank
@@ Property / cites work @@
+Note—A Note on Dynamic Programming with Unbounded Rewards
+Normal rank
@@ Property / cites work @@
+Q3994363
@@ Property / cites work: Q3994363 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4844751
@@ Property / cites work: Q4844751 / rank @@
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:2375462