scientific article; zbMATH DE number 1786117
From MaRDI portal
Publication:4547437
Recommendations
- Existence of optimal stationary policies in average reward Markov decision processes with a recurrent state
- Average Optimality in Dynamic Programming with General State Space
- scientific article; zbMATH DE number 94713
- Average, Sensitive and Blackwell Optimal Policies in Denumerable Markov Decision Chains with Unbounded Rewards
- An expected average reward criterion
Cited in
(7)- Average optimality for Markov decision processes in borel spaces: a new condition and approach
- A Weighted Markov Decision Process
- The Optimal Reward Operator in Negative Dynamic Programming
- A semimartingale characterization of average optimal stationary policies for Markov decision processes
- Another set of conditions for Markov decision processes with average sample-path costs
- Another set of verifiable conditions for average Markov decision processes with Borel spaces.
- Sample-path optimality and variance-maximization for Markov decision processes
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4547437)