Coupling based estimation approaches for the average reward performance potential in Markov chains
From MaRDI portal
Publication:1796998
DOI10.1016/j.automatica.2018.03.011zbMath1414.60061OpenAlexW2794916412WikidataQ130046852 ScholiaQ130046852MaRDI QIDQ1796998
Jiangang Li, Yunjiang Lou, Haoyao Chen, Xin-Yu Wu, Yan-Jie Li
Publication date: 17 October 2018
Published in: Automatica (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.automatica.2018.03.011
value functionperformance potentialcoupling techniquesestimation with geometric variance reductionperturbation realization factor
Estimation and detection in stochastic control theory (93E10) Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20)
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- A unified approach to time-aggregated Markov decision processes
- Performance optimization of queueing systems with perturbation realization
- Parameterized Markov decision process and its application to service rate control
- Performance analysis for controlled semi-Markov systems with application to maintenance
- Simulation-based algorithms for Markov decision processes.
- Continuous-time Markov decision processes. Theory and applications
- Regenerative structure of Markov chains simulated via common random numbers
- Bivariate distributions with given marginals
- Adapative importance sampling on discrete Markov chains
- Sequential Monto Carlo techniques for the solution of linear systems
- Stability Analysis of A Class of Hybrid Stochastic Retarded Systems Under Asynchronous Switching
- Error bounds of optimization algorithms for semi-Markov decision processes
- A simple coupling of renewal processes
- A maximal coupling for Markov chains
- Perturbation realization, potentials, and sensitivity analysis of Markov processes
- CONVERGENCE OF SIMULATION-BASED POLICY ITERATION
- Perturbation analysis via coupling
- Simulation-based optimization of Markov reward processes
- On Solving Event-Based Optimization With Average Reward Over Infinite Stages
- Sequential Control Variates for Functionals of Markov Processes