Variational characterizations in Markov decision processes (Q1077334): Difference between revisions

From MaRDI portal
RedirectionBot (talk | contribs)
Removed claims
ReferenceBot (talk | contribs)
Changed an Item
 
(2 intermediate revisions by 2 users not shown)
Property / author
 
Property / author: Awi Federgruen / rank
 
Normal rank
Property / reviewed by
 
Property / reviewed by: Howard J. Weiner / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: Publication / rank
 
Normal rank
Property / cites work
 
Property / cites work: Contraction Mappings in the Theory Underlying Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov Renewal Programs with Small Interest Rates / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multichain Markov Renewal Programs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3206684 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Successive Approximation Methods for Solving Nested Functional Equations in Markov Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Technical Note—Elimination of Suboptimal Actions in Markov Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Technical Note—Bounds on the Gain of a Markov Decision Process / rank
 
Normal rank
Property / cites work
 
Property / cites work: Note—A Test for Nonoptimal Actions in Undiscounted Finite Markov Decision Chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Tests for Suboptimal Actions in Discounted Markov Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Linear Programming and Markov Decision Chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov-Renewal Programming. I: Formulation, Finite Return Models / rank
 
Normal rank
Property / cites work
 
Property / cites work: Potentials for denumerable Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: A modified dynamic programming method for Markovian decision problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discrete Dynamic Programming with a Small Interest Rate / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5821083 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Finding the Maximal Gain for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some Bounds for Discounted Sequential Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Action Elimination Procedures for Modified Policy Iteration Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the solvability of Bellman's functional equation for a Markovian decision process / rank
 
Normal rank
Property / cites work
 
Property / cites work: Perturbation theory and finite Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Iterative solution of the functional equations of undiscounted Markov renewal programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the solvability of Bellman's functional equations for Markov renewal programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Functional Equations of Undiscounted Markov Renewal Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Geometric convergence of value-iteration in multichain Markov decision problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discrete Dynamic Programming with Sensitive Discount Optimality Criteria / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3908795 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A UNIFIED APPROACH TO ALGORITHMS WITH A SUBOPTIMALITY TEST IN DISCOUNTED SEMI-MARKOV DECISION PROCESSES / rank
 
Normal rank

Latest revision as of 13:44, 17 June 2024

scientific article
Language Label Description Also known as
English
Variational characterizations in Markov decision processes
scientific article

    Statements

    Variational characterizations in Markov decision processes (English)
    0 references
    0 references
    0 references
    1986
    0 references
    Variational characterizations and bounds for the solutions of systems of functional equations in discounted and undiscounted semi-Markov decision processes are obtained. Such upper and lower bounds can be used to measure the deviation of the current solution from optimality. The variational characterizations suggest numerical algorithms.
    0 references
    Variational characterizations
    0 references
    discounted and undiscounted semi-Markov decision processes
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers