Sample-path optimality in average Markov decision chains under a double Lyapunov function condition
From MaRDI portal
Publication:4593600
Recommendations
- Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion
- A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion
- Denumerable controlled Markov chains with average reward criterion: Sample path optimality
- Sample-path average optimality for Markov control processes
- Sample path optimality for a Markov optimization problem
Cited in
(3)- Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion
- Another set of conditions for Markov decision processes with average sample-path costs
- A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion
This page was built for publication: Sample-path optimality in average Markov decision chains under a double Lyapunov function condition
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4593600)