A \(K\)-step look-ahead analysis of value iteration algorithms for Markov decision processes (Q1266643): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Truncated policy iteration methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4193284 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Criteria for selecting the relaxation factor of the value iteration algorithm for undiscounted Markov and semi-Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Accelerating Procedures of the Value Iteration Algorithm for Discounted Markov Decision Processes, Based on a One-Step Lookahead Analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4284156 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Branch-and-Bound Strategies for Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Technical Note—Undiscounted Markov Renewal Programming Via Modified Successive Approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4190426 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A set of successive approximation methods for discounted Markovian decision problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Computing Optimal Policies for Controlled Tandem Queueing Systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discrete versions of an algorithm due to Varaiya / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some Bounds for Discounted Sequential Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Technical Note—Accelerated Computation of the Expected Discounted Return in a Markov Chain / rank
 
Normal rank
Property / cites work
 
Property / cites work: Bounds and Transformations for Discounted Finite Markov Decision Chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Modified Policy Iteration Algorithms for Discounted Markov Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Action Elimination Procedures for Modified Policy Iteration Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: The convergence of value iteration in discounted Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Iterative solution of the functional equations of undiscounted Markov renewal programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: A simple technique in Markovian control with applications to resource allocation to resource allocation in communication networks / rank
 
Normal rank
Property / cites work
 
Property / cites work: Computational comparison of value iteration algorithms for discounted Markov decision processes / rank
 
Normal rank

Latest revision as of 16:36, 28 May 2024

scientific article
Language Label Description Also known as
English
A \(K\)-step look-ahead analysis of value iteration algorithms for Markov decision processes
scientific article

    Statements

    A \(K\)-step look-ahead analysis of value iteration algorithms for Markov decision processes (English)
    0 references
    0 references
    0 references
    31 May 1999
    0 references
    0 references
    modified policy iteration
    0 references
    adaptive relaxation factor
    0 references
    look-ahead analysis
    0 references
    value iteration
    0 references
    discounted and undiscounted Markov decision processes
    0 references
    0 references
    0 references
    0 references
    0 references