Serial and parallel value iteration algorithms for discounted Markov decision processes (Q2367364): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
ReferenceBot (talk | contribs)
Changed an Item
 
(3 intermediate revisions by 2 users not shown)
Property / author
 
Property / author: Lyn C. Thomas / rank
Normal rank
 
Property / author
 
Property / author: Lyn C. Thomas / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Generation of Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3795523 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On iterative optimization ol structured Markov decision processes with discounted rewards / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discounting, Ergodicity and Convergence for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Computing the discounted return in markov and semi-markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Improved iterative computation of the expected discounted return in Markov and semi-Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some Bounds for Discounted Sequential Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Computational comparison of value iteration algorithms for discounted Markov decision processes / rank
 
Normal rank

Latest revision as of 18:25, 17 May 2024

scientific article
Language Label Description Also known as
English
Serial and parallel value iteration algorithms for discounted Markov decision processes
scientific article

    Statements

    Serial and parallel value iteration algorithms for discounted Markov decision processes (English)
    0 references
    0 references
    0 references
    0 references
    13 July 1994
    0 references
    This paper extends the work of \textit{L. C. Thomas}, \textit{R. Hartley} and \textit{A. C. Lavercombe} [Oper. Res. Lett. 2, 72-76 (1983; Zbl 0511.90094)], where a number of serial value iteration schemes for the solution of discounted Markov decision processes were appraised. Several of the schemes are re-assessed using carefully chosen test problems. Parallel implementation of the algorithms, based on state space partition, is then discussed. Detailed performance data are given which demonstrate considerable efficiency gains over the serial approach. Throughout, the influence of problem structure is stressed.
    0 references
    0 references
    transputers
    0 references
    parallel implementation
    0 references
    serial value iteration schemes
    0 references
    discounted Markov decision processes
    0 references
    state space partition
    0 references