Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria
From MaRDI portal
Publication:2474553
DOI10.1007/s00186-005-0024-6zbMath1137.93055MaRDI QIDQ2474553
J. Adolfo Minjárez-Sosa, Nadine Hilgert
Publication date: 6 March 2008
Published in: Mathematical Methods of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s00186-005-0024-6
empirical distribution; discrete-time stochastic systems; discounted cost criteria; optimal adaptive policy
60J05: Discrete-time Markov processes on general state spaces
93C40: Adaptive control/observation systems
93C55: Discrete-time control/observation systems
93E10: Estimation and detection in stochastic control theory
93E20: Optimal stochastic control
90C40: Markov and semi-Markov decision processes
Related Items
Time-varying Markov decision processes with state-action-dependent discount factors and unbounded costs, Discrete-time control for systems of interacting objects with unknown random disturbance distributions: a mean field approach, Markov control models with unknown random state-action-dependent discount factors, Empirical estimation in average Markov control processes, Unnamed Item, Adaptive control of diffusion processes with a discounted reward criterion
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Adaptive control of discounted Markov decision chains
- Nonparametric adaptive control of discounted stochastic systems with compact state space
- Measurable selection theorems for optimization problems
- Empirical processes: A survey of results for independent and identically distributed random variables
- A note on the Ross-Taylor theorem
- Adaptive policies for time-varying stochastic systems under discounted criterion
- Limiting optimal discounted-cost control of a class of time-varying stochastic systems
- Adaptive Markov control processes
- Density estimation and adaptive control of Markov processes: Average and discounted criteria
- Estimation and control in discounted stochastic dynamic programming
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Note—A Note on Dynamic Programming with Unbounded Rewards
- Estimation and control in Markov chains
- Uniformity in weak convergence
- Relations between Weak and Uniform Convergence of Measures with Applications