Markov decision processes
From MaRDI portal
Publication:5904001
DOI10.1016/0377-2217(89)90348-2zbMath0677.90086MaRDI QIDQ5904001
Douglas J. White, Chelsea C. III White
Publication date: 1989
Published in: European Journal of Operational Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/0377-2217(89)90348-2
introduction; adaptive; multiobjective; discrete event dynamic systems; constrained models; semi-Markov; partially observed
Related Items
Continuous time shock markov decision processes with discounted criterion, Unnamed Item, An Heuristic for Multi-Dimensional Markov Decision Processes, A survey of solution techniques for the partially observed Markov decision process, Optimal recovery strategies for manufacturing systems, Optimal cost and policy for a Markovian replacement problem, Sequential process control under capacity constraints., Multiaction maintenance subject to action-dependent risk and stochastic failure, A multi-period TSP with stochastic regular and urgent demands, On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds
- Reward revision and the average reward Markov decision process
- Optimality and efficiency. I
- Stochastic optimal control. The discrete time case
- Multi-objective infinite-horizon discounted Markov decision processes
- Infinite horizon Markov decision processes with unknown or variable discount factors
- Mean, variance and probabilistic criteria in finite Markov decision processes: A review
- Dynamic programming, Markov chains, and the method of successive approximations
- Sufficient statistics in the optimum control of stochastic systems
- A modified dynamic programming method for Markovian decision problems
- Finite state Markovian decision processes
- Vector-Valued Dynamic Programming
- Iterative Aggregation-Disaggregation Procedures for Discounted Semi-Markov Reward Processes
- Reward Revision for Discounted Markov Decision Problems
- Parameter Imprecision in Finite State, Finite Action Dynamic Programs
- Performance evaluation and perturbation analysis of discrete event dynamic systems
- Suboptimal Design for Large Scale, Multimodule Systems
- Action Elimination Procedures for Modified Policy Iteration Algorithms
- An Iterative Aggregation Procedure for Markov Decision Processes
- State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms
- Convergence of Dynamic Programming Models
- On the Optimality of Myopic Policies in Sequential Decision Problems
- Bounds and Transformations for Discounted Finite Markov Decision Chains
- Minimizing a Submodular Function on a Lattice
- Sequential Decision Problems with Expected Utility Criteria. III: Upper and Lower Transience
- Modified Policy Iteration Algorithms for Discounted Markov Decision Problems
- Approximations of Dynamic Programs, I
- The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems
- Approximations of Dynamic Programs, II
- A Survey of Applications of Markov Decision Processes
- Markov Decision Processes with Imprecise Transition Probabilities
- Discounted Dynamic Programming
- Contraction Mappings in the Theory Underlying Dynamic Programming
- Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems
- On Finding the Maximal Gain for Markov Decision Processes
- Some Bounds for Discounted Sequential Decision Processes
- Suboptimal policy determination for large-scale Markov decision processes. II: Implementation and numerical evaluation