scientific article; zbMATH DE number 4170671
From MaRDI portal
Publication:3496174
zbMATH Open0711.90084MaRDI QIDQ3496174FDOQ3496174
Authors: Jinjong Zhan, Yun Zhang
Publication date: 1987
Title of this publication is not available (Why is that?)
Recommendations
- scientific article; zbMATH DE number 4003943
- scientific article; zbMATH DE number 3898637
- scientific article; zbMATH DE number 1536370
- Publication:3496172
- Estimate and approximate policy iteration algorithm for discounted Markov decision models with bounded costs and Borel spaces
- On discounted dynamic programming with unbounded returns
- Discounted and average Markov decision processes with unbounded rewards: New conditions
- Structures of optimal policies in MDPs with unbounded jumps: the state of our art
- Strong 0-discount optimal policies in a Markov decision process with a Borel state space
- scientific article; zbMATH DE number 877674
Cited In (9)
- Title not available (Why is that?)
- Existence of optimal stationary policies in discounted Markov decision processes: Approaches by occupation measures
- Estimate and approximate policy iteration algorithm for discounted Markov decision models with bounded costs and Borel spaces
- Title not available (Why is that?)
- On the properties of \(\epsilon\) (\(\geq 0)\) optimal policies in discounted unbounded return model
- PAC Bounds for Discounted MDPs
- Title not available (Why is that?)
- Title not available (Why is that?)
- Bounded Parameter Markov Decision Processes with Average Reward Criterion
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3496174)