MDPs with setwise continuous transition probabilities
From MaRDI portal
Publication:2060367
DOI10.1016/J.ORL.2021.07.011OpenAlexW3190303348MaRDI QIDQ2060367FDOQ2060367
Authors: Pavlo O. Kasyanov, Eugene A. Feinberg
Publication date: 13 December 2021
Published in: Operations Research Letters (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2011.01325
Recommendations
- Average cost Markov decision processes with weakly continuous transition probabilities
- On a set of optimal policies in continuous time Markovian decision problem
- On some continuous time discounted Markov decision process.
- Continuous-Time Markov Decision Processes with Discounted Rewards: The Case of Polish Spaces
- scientific article; zbMATH DE number 700091
Cites Work
- Title not available (Why is that?)
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Measurable selections of extrema
- Sufficient Classes of Strategies in Discrete Dynamic Programming I: Decomposition of Randomized Strategies and Embedded Models
- Average Optimality in Dynamic Programming with General State Space
- Title not available (Why is that?)
- On Stationary Strategies in Borel Dynamic Programming
- Average optimality in dynamic programming on Borel spaces -- unbounded costs and controls
- Average cost Markov decision processes with weakly continuous transition probabilities
- Examples concerning Abel and Cesàro limits
- Negative Dynamic Programming
- Partially observable total-cost Markov decision processes with weakly continuous transition probabilities
- Optimality Inequalities for Average Cost Markov Decision Processes and the Stochastic Cash Balance Problem
- Berge's theorem for noncompact image sets
- Optimal Plans for Dynamic Programming Problems
- Measurable selection theorems for optimization problems
- Measurable Selection and Dynamic Programming
- Title not available (Why is that?)
- Stationary policies and Markov policies in Borel dynamic programming
- Title not available (Why is that?)
- On convergence of value iteration for a class of total cost Markov decision processes
- Title not available (Why is that?)
- Sufficiency of deterministic policies for atomless discounted and uniformly absorbing MDPs with multiple criteria
- Fatou's lemma in its classical form and Lebesgue's convergence theorems for varying measures with applications to Markov decision processes
Cited In (4)
This page was built for publication: MDPs with setwise continuous transition probabilities
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2060367)