Density estimation and adaptive control of Markov processes: Average and discounted criteria
DOI10.1007/BF00049572zbMath0717.93066MaRDI QIDQ2639029
Rolando Cavazos-Cadena, Onésimo Hernández-Lerma
Publication date: 1990
Published in: Acta Applicandae Mathematicae (Search for Journal in Brave)
adaptive controldensity estimationnonstationary value iterationdiscrete-time Markov control processesprinciple of estimation and controlunknown distributionaverage-reward criterionnonparametric adaptive policies
Nonparametric estimation (62G05) Estimation and detection in stochastic control theory (93E10) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40)
Related Items (12)
Cites Work
- Adaptive control of discounted Markov decision chains
- Dynamic programming and stochastic control
- Nonparametric adaptive control of discounted stochastic systems with compact state space
- Nonstationary value-iteration and adaptive control of discounted semi- Markov processes
- Finite-state approximations for denumerable state discounted Markov decision processes
- Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution
- Continuous dependence of stochastic control models on the noise distribution
- Stochastic optimal control. The discrete time case
- Conditions for the equivalence of optimality criteria in dynamic programming
- Adaptive Markov control processes
- Adaptive Strategies for Certain Classes of Controlled Markov Processes
- Estimation and control in discounted stochastic dynamic programming
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Optimal Plans for Dynamic Programming Problems
- Estimation and control in Markov chains
- On Two Recent Papers on Ergodicity in Nonhomogeneous Markov Chains
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
This page was built for publication: Density estimation and adaptive control of Markov processes: Average and discounted criteria