scientific article; zbMATH DE number 4170671
From MaRDI portal
Publication:3496174
Recommendations
- scientific article; zbMATH DE number 4003943
- scientific article; zbMATH DE number 3898637
- scientific article; zbMATH DE number 1536370
- Publication:3496172
- Estimate and approximate policy iteration algorithm for discounted Markov decision models with bounded costs and Borel spaces
- On discounted dynamic programming with unbounded returns
- Discounted and average Markov decision processes with unbounded rewards: New conditions
- Structures of optimal policies in MDPs with unbounded jumps: the state of our art
- Strong 0-discount optimal policies in a Markov decision process with a Borel state space
- scientific article; zbMATH DE number 877674
Cited in
(9)- Existence of optimal stationary policies in discounted Markov decision processes: Approaches by occupation measures
- scientific article; zbMATH DE number 4003943 (Why is no real title available?)
- Estimate and approximate policy iteration algorithm for discounted Markov decision models with bounded costs and Borel spaces
- scientific article; zbMATH DE number 3898637 (Why is no real title available?)
- On the properties of ( 0) optimal policies in discounted unbounded return model
- PAC Bounds for Discounted MDPs
- scientific article; zbMATH DE number 4158414 (Why is no real title available?)
- scientific article; zbMATH DE number 4170670 (Why is no real title available?)
- Bounded Parameter Markov Decision Processes with Average Reward Criterion
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3496174)