Single sample path-based optimization of Markov chains
From MaRDI portal
Recommendations
Cites work
- scientific article; zbMATH DE number 3906232 (Why is no real title available?)
- scientific article; zbMATH DE number 700091 (Why is no real title available?)
- scientific article; zbMATH DE number 1095138 (Why is no real title available?)
- Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey
- Feature-based methods for large scale dynamic programming
- Likelihood ratio gradient estimation for stochastic recursions
- Markov chains and stochastic stability
- Perturbation realization, potentials, and sensitivity analysis of Markov processes
- Perturbation theory and finite Markov chains
- Sample-path optimization of convex stochastic performance functions
- Stochastic optimization of regenerative systems using infinitesimal perturbation analysis
- The policy iteration algorithm for average reward Markov decision processes with general state space
- Using the QR Factorization and Group Inversion to Compute, Differentiate, and Estimate the Sensitivity of Stationary Probabilities for Markov Chains
Cited in
(11)- scientific article; zbMATH DE number 1961138 (Why is no real title available?)
- Generalized estimates for performance sensitivities of stochastic systems
- Potential-based least-squares policy iteration for a parameterized feedback control system
- Temporal difference-based policy iteration for optimal control of stochastic systems
- Computing MDP cost function for high speed networks with sample-path and quantization
- A time aggregation approach to Markov decision processes
- Basic ideas for event-based optimization of Markov systems
- The control of a two-level Markov decision process by time aggregation
- scientific article; zbMATH DE number 3901881 (Why is no real title available?)
- How to optimize discrete-event systems from a single sample path by the score function method
- A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: multichain cases
This page was built for publication: Single sample path-based optimization of Markov chains
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1289394)