Value iteration in average cost Markov control processes on Borel spaces
From MaRDI portal
Publication:1906804
DOI10.1007/BF00047169zbMath0843.93093MaRDI QIDQ1906804
Onésimo Hernández-Lerma, Raúl Montes-De-oca
Publication date: 6 February 1996
Published in: Acta Applicandae Mathematicae (Search for Journal in Brave)
Related Items
Approximate dynamic programming for stochastic linear control problems on compact state spaces ⋮ Approximation of average cost optimal policies for general Markov decision processes with unbounded costs ⋮ A pause control approach to the value iteration scheme in average Markov decision processes ⋮ Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion ⋮ Open Problem—Convergence and Asymptotic Optimality of the Relative Value Iteration in Ergodic Control
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Value iteration in countable state average cost Markov decision processes with unbounded costs
- Discretization procedures for adaptive Markov control processes
- On optimality criteria for dynamic programs with long finite horizons
- Average cost Markov decision processes: Optimality conditions
- Average optimality in dynamic programming on Borel spaces -- unbounded costs and controls
- Measurable selection theorems for optimization problems
- Value iteration and rolling plans for Markov control processes with unbounded rewards
- The average cost optimality equation for Markov control processes on Borel spaces
- Adaptive Markov control processes
- Invariant probabilities for Feller-Markov chains
- Dynamic programming, Markov chains, and the method of successive approximations
- Recurrence conditions for Markov decision processes with Borel state space: A survey
- Density estimation and adaptive control of Markov processes: Average and discounted criteria
- SUR UN MODÉLE AUTORÉGRESSIF NON LINÉAIRE, ERGODICITÉ ET ERGODICITÉ GÉOMÉTRIQUE
- On a Class of Strategies in General Markov Decision Models
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Linear Programming and Average Optimality of Markov Control Processes on Borel Spaces—Unbounded Costs
- Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey
- Average Optimality in Dynamic Programming with General State Space
- Average cost Markov control processes with weighted norms: existence of canonical policies