Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards
From MaRDI portal
Publication:1974589
Recommendations
Cited in
(14)- Markov decision processes with state-dependent discount factors and unbounded rewards/costs
- Blackwell optimality in the class of Markov policies for continuous-time controlled Markov chains
- Blackwell optimal policies in a Markov decision process with a Borel state space
- First passage optimality and variance minimisation of Markov decision processes with varying discount factors
- A survey of recent results on continuous-time Markov decision processes (with comments and rejoinder)
- Average, Sensitive and Blackwell Optimal Policies in Denumerable Markov Decision Chains with Unbounded Rewards
- Markov control models with unknown random state-action-dependent discount factors
- Optimality of mixed policies for average continuous-time Markov decision processes with constraints
- scientific article; zbMATH DE number 7232788 (Why is no real title available?)
- Ergodic Control, Bias, and Sensitive Discount Optimality for Markov Diffusion Processes
- Are limits of -discounted optimal policies Blackwell optimal? A counterexample
- Characterization and sufficient conditions for normed ergodicity of Markov chains
- Strong bounds on perturbations
- Average optimality for Markov decision processes in borel spaces: a new condition and approach
This page was built for publication: Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1974589)