Serial and parallel value iteration algorithms for discounted Markov decision processes (Q2367364): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: On the Generation of Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3795523 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On iterative optimization ol structured Markov decision processes with discounted rewards / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discounting, Ergodicity and Convergence for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Computing the discounted return in markov and semi-markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Improved iterative computation of the expected discounted return in Markov and semi-Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some Bounds for Discounted Sequential Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Computational comparison of value iteration algorithms for discounted Markov decision processes / rank
 
Normal rank

Latest revision as of 17:25, 17 May 2024

scientific article
Language Label Description Also known as
English
Serial and parallel value iteration algorithms for discounted Markov decision processes
scientific article

    Statements

    Serial and parallel value iteration algorithms for discounted Markov decision processes (English)
    0 references
    0 references
    0 references
    0 references
    13 July 1994
    0 references
    This paper extends the work of \textit{L. C. Thomas}, \textit{R. Hartley} and \textit{A. C. Lavercombe} [Oper. Res. Lett. 2, 72-76 (1983; Zbl 0511.90094)], where a number of serial value iteration schemes for the solution of discounted Markov decision processes were appraised. Several of the schemes are re-assessed using carefully chosen test problems. Parallel implementation of the algorithms, based on state space partition, is then discussed. Detailed performance data are given which demonstrate considerable efficiency gains over the serial approach. Throughout, the influence of problem structure is stressed.
    0 references
    transputers
    0 references
    parallel implementation
    0 references
    serial value iteration schemes
    0 references
    discounted Markov decision processes
    0 references
    state space partition
    0 references

    Identifiers