A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: multichain cases
From MaRDI portal
Publication:705478
DOI10.1016/J.AUTOMATICA.2004.05.003zbMATH Open1059.90141OpenAlexW2051442802MaRDI QIDQ705478FDOQ705478
Publication date: 31 January 2005
Published in: Automatica (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.automatica.2004.05.003
Recommendations
- A unified approach to Markov decision problems and performance sensitivity analysis
- scientific article; zbMATH DE number 1944273
- A unified approach to time-aggregated Markov decision processes
- scientific article; zbMATH DE number 2159039
- From perturbation analysis to Markov decision processes and reinforcement learning
Sensitivity (robustness) (93B35) Markov and semi-Markov decision processes (90C40) Optimal stochastic control (93E20)
Cites Work
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Perturbation realization, potentials, and sensitivity analysis of Markov processes
- Discrete Dynamic Programming
- Single sample path-based optimization of Markov chains
- Simulation-based optimization of Markov reward processes
- CONVERGENCE OF SIMULATION-BASED POLICY ITERATION
- The Maclaurin series for performance functions of Markov chains
- Title not available (Why is that?)
- On Finding Optimal Policies in Discrete Dynamic Programming with No Discounting
- Title not available (Why is that?)
- Limiting average criteria for nonstationary Markov decision processes
- Realization probabilities. The dynamics of queuing systems
- From perturbation analysis to Markov decision processes and reinforcement learning
- A unified approach to Markov decision problems and performance sensitivity analysis
- Minimax control for discrete-time time-varying stochastic systems
- A note on policy algorithms for discounted Markov decision problems
- A Brouwer fixed-point mapping approach to communicating Markov decision processes
- Foolproof convergence in multichain policy iteration
- The relations among potentials, perturbation analysis, and Markov decision processes
- A Fixed Point Approach to Undiscounted Markov Renewal Programs
Cited In (10)
- Bias optimality for multichain continuous-time Markov decision processes
- Performance analysis of AIMD mechanisms over a multi-state Markovian path
- Temporal difference-based policy iteration for optimal control of stochastic systems
- Basic ideas for event-based optimization of Markov systems
- Title not available (Why is that?)
- Continuous-time Markov decision processes with \(n\)th-bias optimality criteria
- Title not available (Why is that?)
- Optimization of a special case of continuous-time Markov decision processes with compact action set
- Completion-of-squares: revisited and extended
- The risk probability criterion for discounted continuous-time Markov decision processes
This page was built for publication: A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: multichain cases
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q705478)