Nonstationary value-iteration and adaptive control of discounted semi- Markov processes
From MaRDI portal
Publication:1068732
DOI10.1016/0022-247X(85)90253-7zbMath0581.90096MaRDI QIDQ1068732
Publication date: 1985
Published in: Journal of Mathematical Analysis and Applications (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/0022-247x(85)90253-7
semi-Markov decision processes; unknown parameters; nonstationary value iteration; denumerable state space; adaptive control of semi-Markov processes; discounted-reward; principle of estimation and control
90C40: Markov and semi-Markov decision processes
Related Items
Adaptive discounted control for piecewise deterministic Markov processes, Performance analysis for controlled semi-Markov systems with application to maintenance, Finite-state approximations for denumerable multidimensional state discounted Markov decision processes, Density estimation and adaptive control of Markov processes: Average and discounted criteria, Unnamed Item
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Adaptive control of service in queueing systems
- Adaptive control of discounted Markov decision chains
- Dynamic programming and stochastic control
- Nonstationary Markov decision problems with converging parameters
- The average-optimal adaptive control of a Markov renewal model in presence of an unknown parameter
- Estimation and control in discounted stochastic dynamic programming
- Strongly consistent estimation in a controlled Markov renewal model
- On Dynamic Programming with Unbounded Rewards
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Estimation and control in Markov chains