(Approximate) iterated successive approximations algorithm for sequential decision processes (Q378751): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Truncated policy iteration methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Contraction Mappings in the Theory Underlying Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Affine Structure and Invariant Policies for Dynamic Programs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Block-successive approximation for a discounted Markov decision model / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3313617 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4547434 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some Bounds for Discounted Sequential Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Improved iterative computation of the expected discounted return in Markov and semi-Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Convergence of Policy Iteration in Stationary Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Modified Policy Iteration Algorithms for Discounted Markov Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Action Elimination Procedures for Modified Policy Iteration Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discounted Markov games: Generalized policy iteration method / rank
 
Normal rank
Property / cites work
 
Property / cites work: A set of successive approximation methods for discounted Markovian decision problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4190426 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximations of Dynamic Programs, I / rank
 
Normal rank

Revision as of 00:48, 7 July 2024

scientific article
Language Label Description Also known as
English
(Approximate) iterated successive approximations algorithm for sequential decision processes
scientific article

    Statements

    (Approximate) iterated successive approximations algorithm for sequential decision processes (English)
    0 references
    0 references
    0 references
    12 November 2013
    0 references
    sequential decision processes
    0 references
    Markov decision chains
    0 references
    successive approximations
    0 references
    modified policy iteration
    0 references

    Identifiers