(Approximate) iterated successive approximations algorithm for sequential decision processes (Q378751): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
Normalize DOI.
 
(4 intermediate revisions by 4 users not shown)
Property / DOI
 
Property / DOI: 10.1007/s10479-012-1073-x / rank
Normal rank
 
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/s10479-012-1073-x / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2014981566 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Truncated policy iteration methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Contraction Mappings in the Theory Underlying Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Affine Structure and Invariant Policies for Dynamic Programs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Block-successive approximation for a discounted Markov decision model / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3313617 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4547434 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some Bounds for Discounted Sequential Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Improved iterative computation of the expected discounted return in Markov and semi-Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Convergence of Policy Iteration in Stationary Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Modified Policy Iteration Algorithms for Discounted Markov Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Action Elimination Procedures for Modified Policy Iteration Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discounted Markov games: Generalized policy iteration method / rank
 
Normal rank
Property / cites work
 
Property / cites work: A set of successive approximation methods for discounted Markovian decision problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4190426 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximations of Dynamic Programs, I / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1007/S10479-012-1073-X / rank
 
Normal rank

Latest revision as of 15:46, 9 December 2024

scientific article
Language Label Description Also known as
English
(Approximate) iterated successive approximations algorithm for sequential decision processes
scientific article

    Statements

    (Approximate) iterated successive approximations algorithm for sequential decision processes (English)
    0 references
    0 references
    0 references
    12 November 2013
    0 references
    sequential decision processes
    0 references
    Markov decision chains
    0 references
    successive approximations
    0 references
    modified policy iteration
    0 references

    Identifiers