Serial and parallel value iteration algorithms for discounted Markov decision processes (Q2367364): Difference between revisions

This paper extends the work of \textit{L. C. Thomas}, \textit{R. Hartley} and \textit{A. C. Lavercombe} [Oper. Res. Lett. 2, 72-76 (1983; Zbl 0511.90094)], where a number of serial value iteration schemes for the solution of discounted Markov decision processes were appraised. Several of the schemes are re-assessed using carefully chosen test problems. Parallel implementation of the algorithms, based on state space partition, is then discussed. Detailed performance data are given which demonstrate considerable efficiency gains over the serial approach. Throughout, the influence of problem structure is stressed.

0 references

zbMATH Keywords

transputers

0 references

parallel implementation

0 references

serial value iteration schemes

0 references

discounted Markov decision processes

0 references

state space partition

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

On the Generation of Markov Decision Processes

0 references

Q3795523

0 references

On iterative optimization ol structured Markov decision processes with discounted rewards

0 references

Q3266141

0 references

Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems

0 references

Discounting, Ergodicity and Convergence for Markov Decision Processes

0 references

Computing the discounted return in markov and semi-markov chains

0 references

Improved iterative computation of the expected discounted return in Markov and semi-Markov chains

0 references

Some Bounds for Discounted Sequential Decision Processes

0 references

Computational comparison of value iteration algorithms for discounted Markov decision processes

0 references

Identifiers

zbMATH Open document ID

0791.90067

0 references

DOI

10.1016/0377-2217(93)90061-Q

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2367364

@@ Property / cites work @@
+On the Generation of Markov Decision Processes
@@ Property / cites work: On the Generation of Markov Decision Processes / rank @@
+Normal rank
@@ Property / cites work @@
+Q3795523
@@ Property / cites work: Q3795523 / rank @@
+Normal rank
@@ Property / cites work @@
+On iterative optimization ol structured Markov decision processes with discounted rewards
+Normal rank
@@ Property / cites work @@
+Q3266141
@@ Property / cites work: Q3266141 / rank @@
+Normal rank
@@ Property / cites work @@
+Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems
+Normal rank
@@ Property / cites work @@
+Discounting, Ergodicity and Convergence for Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+Computing the discounted return in markov and semi-markov chains
+Normal rank
@@ Property / cites work @@
+Improved iterative computation of the expected discounted return in Markov and semi-Markov chains
+Normal rank
@@ Property / cites work @@
+Some Bounds for Discounted Sequential Decision Processes
+Normal rank
@@ Property / cites work @@
+Computational comparison of value iteration algorithms for discounted Markov decision processes
+Normal rank