Block-successive approximation for a discounted Markov decision model (Q2265958): Difference between revisions

We suggest a new successive approximation method to compute the optimal discounted reward for finite state and action, discrete time, discounted Markov decision chains. The method is based on a block partitioning of the (stochastic) matrices corresponding to the stationary policies. The method is particularly attractive when the transition matrices are jointly nearly decomposable or nearly completely decomposable.

0 references

Mathematics Subject Classification ID

90C40

0 references

zbMATH DE Number

3892968

0 references

zbMATH Keywords

successive approximation

0 references

optimal discounted reward

0 references

finite state and action, discrete time, discounted Markov decision chains

0 references

block partitioning

0 references

stationary policies

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1016/0304-4149(85)90046-8

0 references

OpenAlex ID

W2080054160

0 references

cites work

Contraction Mappings in the Theory Underlying Dynamic Programming

0 references

Q4427313

0 references

Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems

0 references

Some Bounds for Discounted Sequential Decision Processes

0 references

Q4173220

0 references

Q5342712

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2265958

@@ Property / full work available at URL @@
+https://doi.org/10.1016/0304-4149(85)90046-8
+Normal rank
@@ Property / OpenAlex ID @@
+W2080054160
@@ Property / OpenAlex ID: W2080054160 / rank @@
+Normal rank
@@ Property / cites work @@
+Contraction Mappings in the Theory Underlying Dynamic Programming
+Normal rank
@@ Property / cites work @@
+Q4427313
@@ Property / cites work: Q4427313 / rank @@
+Normal rank
@@ Property / cites work @@
+Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems
+Normal rank
@@ Property / cites work @@
+Some Bounds for Discounted Sequential Decision Processes
+Normal rank
@@ Property / cites work @@
+Q4173220
@@ Property / cites work: Q4173220 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5342712
@@ Property / cites work: Q5342712 / rank @@
+Normal rank