Exact decomposition approaches for Markov decision processes: a survey

DOI10.1155/2010/659432MaRDI QIDQ606196zbMATH OpenOpenAlexWikidataFDO

Authors Cherki Daoui, Mohamed Tkiouat, Mohammed Abbad

Publication date 16 November 2010

Published in Advances in Operations Research (Search for Journal in Brave)

Full work available at URL https://eudml.org/doc/227280

Research exposition (monographs, survey articles) pertaining to operations research and mathematical programming (90-02) Markov and semi-Markov decision processes (90C40)

Recommendations

A decomposition algorithm for limiting average Markov decision problems.
Algorithms for aggregated limiting average Markov decision problems
scientific article; zbMATH DE number 3961379
The control of a two-level Markov decision process by time aggregation
A methodology for computation reduction for specially structured large scale Markov decision problems

Cites work

scientific article; zbMATH DE number 420890 (Why is no real title available?)
scientific article; zbMATH DE number 5957504 (Why is no real title available?)
scientific article; zbMATH DE number 3126094 (Why is no real title available?)
scientific article; zbMATH DE number 3145626 (Why is no real title available?)
scientific article; zbMATH DE number 3148886 (Why is no real title available?)
scientific article; zbMATH DE number 3852171 (Why is no real title available?)
scientific article; zbMATH DE number 3819432 (Why is no real title available?)
scientific article; zbMATH DE number 4076680 (Why is no real title available?)
scientific article; zbMATH DE number 3783030 (Why is no real title available?)
scientific article; zbMATH DE number 47262 (Why is no real title available?)
scientific article; zbMATH DE number 3574935 (Why is no real title available?)
scientific article; zbMATH DE number 1335900 (Why is no real title available?)
scientific article; zbMATH DE number 700091 (Why is no real title available?)
scientific article; zbMATH DE number 1983335 (Why is no real title available?)
scientific article; zbMATH DE number 2000828 (Why is no real title available?)
scientific article; zbMATH DE number 1509479 (Why is no real title available?)
scientific article; zbMATH DE number 1560499 (Why is no real title available?)
scientific article; zbMATH DE number 1361472 (Why is no real title available?)
scientific article; zbMATH DE number 3793773 (Why is no real title available?)
scientific article; zbMATH DE number 2189770 (Why is no real title available?)
scientific article; zbMATH DE number 3356467 (Why is no real title available?)
A decomposition algorithm for limiting average Markov decision problems.
A decomposition approach for undiscounted two-person zero-sum stochastic games
Abstraction and approximate decision-theoretic planning.
Algorithms for aggregated limiting average Markov decision problems
An improved algorithm for solving communicating average reward Markov decision processes
Average optimality for continuous-time Markov decision processes with a policy iteration approach
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
Bounds for the Positive Eigenvectors of Nonnegative Matrices and for their Approximations by Decomposition
Calculating availability and performability measures of repairable computer systems using randomization
Decomposition Principle for Linear Programs
Decomposition of systems governed by Markov chains
Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey
Finite state Markovian decision processes
Hierarchical algorithms for discounted and weighted Markov decision processes
Markov Decision Processes with Variance Minimization: A New Condition and Approach
Markov decision processes with exponentially representable discounting
Multichain Markov Decision Processes with a Sample Path Constraint: A Decomposition Approach
On some algorithms for limiting average Markov decision processes
Optimal decision procedures for finite Markov chains. Part III: General convex systems
Planning and acting in partially observable stochastic domains
Planning in a hierarchy of abstraction spaces
Sample-path optimality and variance-maximization for Markov decision processes
The Complexity of Markov Decision Processes
Using Expectation-Maximization for Reinforcement Learning
Weighted reward criteria in Competitive Markov Decision Processes

Cited in

(11)

Approximated timed reachability graphs for the robust control of discrete event systems
Time aggregated Markov decision processes via standard dynamic programming
Decomposable Markov decision processes: A fluid optimization approach
Aggregation of the policy iteration method for nearly completely decomposable Markov chains
Temporal concatenation for Markov decision processes
Algorithms for aggregated limiting average Markov decision problems
A decomposition algorithm for limiting average Markov decision problems.
A new parallelized of hierarchical value iteration algorithm for discounted Markov decision processes
On some algorithms for limiting average Markov decision processes
Multichain Markov Decision Processes with a Sample Path Constraint: A Decomposition Approach
scientific article; zbMATH DE number 7529534 (Why is no real title available?)

This page was built for publication: Exact decomposition approaches for Markov decision processes: a survey

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q606196)