Exact decomposition approaches for Markov decision processes: a survey (Q606196): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
ReferenceBot (talk | contribs)
Changed an Item
 
(5 intermediate revisions by 4 users not shown)
Property / author
 
Property / author: Mohammed Abbad / rank
Normal rank
 
Property / author
 
Property / author: Mohammed Abbad / rank
 
Normal rank
Property / Wikidata QID
 
Property / Wikidata QID: Q58650427 / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2091387107 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3262596 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4144754 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Calculating availability and performability measures of repairable computer systems using randomization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Bounds for the Positive Eigenvectors of Nonnegative Matrices and for their Approximations by Decomposition / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3321201 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov decision processes with exponentially representable discounting / rank
 
Normal rank
Property / cites work
 
Property / cites work: On some algorithms for limiting average Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4427313 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3174169 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3135096 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sample-path optimality and variance-maximization for Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Average optimality for continuous-time Markov decision processes with a policy iteration approach / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov Decision Processes with Variance Minimization: A New Condition and Approach / rank
 
Normal rank
Property / cites work
 
Property / cites work: An improved algorithm for solving communicating average reward Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Planning and acting in partially observable stochastic domains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3960718 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4699290 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Complexity of Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Algorithms for aggregated limiting average Markov decision problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Hierarchical algorithms for discounted and weighted Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: A decomposition algorithm for limiting average Markov decision problems. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal decision procedures for finite Markov chains. Part III: General convex systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multichain Markov Decision Processes with a Sample Path Constraint: A Decomposition Approach / rank
 
Normal rank
Property / cites work
 
Property / cites work: Abstraction and approximate decision-theoretic planning. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4506458 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3241581 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Using Expectation-Maximization for Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4527272 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5630824 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Decomposition Principle for Linear Programs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Decomposition of systems governed by Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite state Markovian decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4739658 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Weighted reward criteria in Competitive Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5463020 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A decomposition approach for undiscounted two-person zero-sum stochastic games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4434179 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3807647 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Planning in a hierarchy of abstraction spaces / rank
 
Normal rank
Property / cites work
 
Property / cites work: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4258591 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3666564 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3998396 / rank
 
Normal rank

Latest revision as of 12:10, 3 July 2024

scientific article
Language Label Description Also known as
English
Exact decomposition approaches for Markov decision processes: a survey
scientific article

    Statements

    Exact decomposition approaches for Markov decision processes: a survey (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    16 November 2010
    0 references
    Summary: As classical methods are intractable for solving Markov decision processes (MDPs) requiring a large state space, decomposition and aggregation techniques are very useful to cope with large problems. These techniques are in general a special case of the classic Divide-and-Conquer framework to split a large, unwieldy problem into smaller components and solving the parts in order to construct the global solution. This paper reviews most of decomposition approaches encountered in the associated literature over the past two decades, weighing their pros and cons. We consider several categories of MDPs (average, discounted, and weighted MDPs), and we present briefly a variety of methodologies to find or approximate optimal strategies.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references