Exact decomposition approaches for Markov decision processes: a survey
From MaRDI portal
(Redirected from Publication:606196)
Recommendations
- A decomposition algorithm for limiting average Markov decision problems.
- Algorithms for aggregated limiting average Markov decision problems
- scientific article; zbMATH DE number 3961379
- The control of a two-level Markov decision process by time aggregation
- A methodology for computation reduction for specially structured large scale Markov decision problems
Cites work
- scientific article; zbMATH DE number 420890 (Why is no real title available?)
- scientific article; zbMATH DE number 5957504 (Why is no real title available?)
- scientific article; zbMATH DE number 3126094 (Why is no real title available?)
- scientific article; zbMATH DE number 3145626 (Why is no real title available?)
- scientific article; zbMATH DE number 3148886 (Why is no real title available?)
- scientific article; zbMATH DE number 3852171 (Why is no real title available?)
- scientific article; zbMATH DE number 3819432 (Why is no real title available?)
- scientific article; zbMATH DE number 4076680 (Why is no real title available?)
- scientific article; zbMATH DE number 3783030 (Why is no real title available?)
- scientific article; zbMATH DE number 47262 (Why is no real title available?)
- scientific article; zbMATH DE number 3574935 (Why is no real title available?)
- scientific article; zbMATH DE number 1335900 (Why is no real title available?)
- scientific article; zbMATH DE number 700091 (Why is no real title available?)
- scientific article; zbMATH DE number 1983335 (Why is no real title available?)
- scientific article; zbMATH DE number 2000828 (Why is no real title available?)
- scientific article; zbMATH DE number 1509479 (Why is no real title available?)
- scientific article; zbMATH DE number 1560499 (Why is no real title available?)
- scientific article; zbMATH DE number 1361472 (Why is no real title available?)
- scientific article; zbMATH DE number 3793773 (Why is no real title available?)
- scientific article; zbMATH DE number 2189770 (Why is no real title available?)
- scientific article; zbMATH DE number 3356467 (Why is no real title available?)
- A decomposition algorithm for limiting average Markov decision problems.
- A decomposition approach for undiscounted two-person zero-sum stochastic games
- Abstraction and approximate decision-theoretic planning.
- Algorithms for aggregated limiting average Markov decision problems
- An improved algorithm for solving communicating average reward Markov decision processes
- Average optimality for continuous-time Markov decision processes with a policy iteration approach
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Bounds for the Positive Eigenvectors of Nonnegative Matrices and for their Approximations by Decomposition
- Calculating availability and performability measures of repairable computer systems using randomization
- Decomposition Principle for Linear Programs
- Decomposition of systems governed by Markov chains
- Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey
- Finite state Markovian decision processes
- Hierarchical algorithms for discounted and weighted Markov decision processes
- Markov Decision Processes with Variance Minimization: A New Condition and Approach
- Markov decision processes with exponentially representable discounting
- Multichain Markov Decision Processes with a Sample Path Constraint: A Decomposition Approach
- On some algorithms for limiting average Markov decision processes
- Optimal decision procedures for finite Markov chains. Part III: General convex systems
- Planning and acting in partially observable stochastic domains
- Planning in a hierarchy of abstraction spaces
- Sample-path optimality and variance-maximization for Markov decision processes
- The Complexity of Markov Decision Processes
- Using Expectation-Maximization for Reinforcement Learning
- Weighted reward criteria in Competitive Markov Decision Processes
Cited in
(11)- Approximated timed reachability graphs for the robust control of discrete event systems
- Time aggregated Markov decision processes via standard dynamic programming
- Decomposable Markov decision processes: A fluid optimization approach
- Aggregation of the policy iteration method for nearly completely decomposable Markov chains
- Temporal concatenation for Markov decision processes
- Algorithms for aggregated limiting average Markov decision problems
- A decomposition algorithm for limiting average Markov decision problems.
- A new parallelized of hierarchical value iteration algorithm for discounted Markov decision processes
- On some algorithms for limiting average Markov decision processes
- Multichain Markov Decision Processes with a Sample Path Constraint: A Decomposition Approach
- scientific article; zbMATH DE number 7529534 (Why is no real title available?)
This page was built for publication: Exact decomposition approaches for Markov decision processes: a survey
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q606196)