Markov decision processes with state-dependent discount factors and unbounded rewards/costs
From MaRDI portal
Publication:408405
DOI10.1016/j.orl.2011.06.014zbMath1235.90178OpenAlexW2055647224MaRDI QIDQ408405
Publication date: 5 April 2012
Published in: Operations Research Letters (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.orl.2011.06.014
Markov decision processesoptimal value functionoptimal stationary policystate-dependent discount factorsunbounded costs/rewards
Related Items
Constrained Markov control processes with randomized discounted cost criteria: infinite linear programming approach, First passage problems for nonstationary discrete-time stochastic control systems, First Passage Optimality and Variance Minimisation of Markov Decision Processes with Varying Discount Factors, Certified reinforcement learning with logic guidance, Some advances on constrained Markov decision processes in Borel spaces with random state-dependent discount factors, Delay-Minimizing Capacity Allocation in an Infinite Server-Queueing System, An average-value-at-risk criterion for Markov decision processes with unbounded costs, Discrete-time control with non-constant discount factor, Zero-sum Markov games with random state-actions-dependent discount factors: existence of optimal strategies, Finite approximation of the first passage models for discrete-time Markov decision processes with varying discount factors, Deep reinforcement learning with temporal logics, Convergence of Markov decision processes with constraints and state-action dependent discount factors, A mean field absorbing control model for interacting objects systems, Semi-Markov decision processes with variance minimization criterion, Zero-sum semi-Markov games with state-action-dependent discount factors, First passage Markov decision processes with constraints and varying discount factors
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Markov decision processes with exponentially representable discounting
- A convex analytic approach to Markov decision processes
- Discounted cost Markov decision processes on Borel spaces: The linear programming formulation
- Discounting the distant future: How much do uncertain rates increase valuations?
- Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards
- Markov control processes with randomized discounted cost
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Markov Decision Models with Weighted Discounted Criteria