The method of value oriented successive approximations for the average reward Markov decision process (Q1144501): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Optimal decision procedures for finite Markov chains. Part II: Communicating systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3245701 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3251743 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Technical Note—Bounds on the Gain of a Markov Decision Process / rank
 
Normal rank
Property / cites work
 
Property / cites work: Technical Note—The Method of Successive Approximations and Markovian Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Technical Note—Undiscounted Markov Renewal Programming Via Modified Successive Approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discounting, Ergodicity and Convergence for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: A set of successive approximation methods for discounted Markovian decision problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4190426 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Finding the Maximal Gain for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Technical Note—Improved Conditions for Convergence in Undiscounted Markov Renewal Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some Bounds for Discounted Sequential Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Iterative solution of the functional equations of undiscounted Markov renewal programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Geometric convergence of value-iteration in multichain Markov decision problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: A successive approximation algorithm for an undiscounted Markov decision process / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic programming, Markov chains, and the method of successive approximations / rank
 
Normal rank

Revision as of 10:21, 13 June 2024

scientific article
Language Label Description Also known as
English
The method of value oriented successive approximations for the average reward Markov decision process
scientific article

    Statements

    The method of value oriented successive approximations for the average reward Markov decision process (English)
    0 references
    0 references
    0 references
    0 references
    1980
    0 references
    0 references
    value oriented successive approximations
    0 references
    average reward
    0 references
    finite state space
    0 references
    finite action space
    0 references
    almost optimal solutions
    0 references
    convergence
    0 references