Sample-path optimality in average Markov decision chains under a double Lyapunov function condition
From MaRDI portal
Publication:4593600
DOI10.1007/978-0-8176-8337-5_3zbMATH Open1374.90400OpenAlexW159673498MaRDI QIDQ4593600FDOQ4593600
Authors: Rolando Cavazos-Cadena, Raúl Montes-de-Oca
Publication date: 22 November 2017
Published in: Optimization, Control, and Applications of Stochastic Systems (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/978-0-8176-8337-5_3
Recommendations
- Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion
- A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion
- Denumerable controlled Markov chains with average reward criterion: Sample path optimality
- Sample-path average optimality for Markov control processes
- Sample path optimality for a Markov optimization problem
Cited In (3)
- Another set of conditions for Markov decision processes with average sample-path costs
- A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion
- Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion
This page was built for publication: Sample-path optimality in average Markov decision chains under a double Lyapunov function condition
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4593600)