Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution
From MaRDI portal
Publication:1099125
DOI10.1016/0167-6911(87)90055-7zbMath0637.93075MaRDI QIDQ1099125
Onésimo Hernández-Lerma, Steven I. Marcus
Publication date: 1987
Published in: Systems \& Control Letters (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/0167-6911(87)90055-7
discounted reward criterion; Adaptive control policies; Discrete-time stochastic control systems; driving process; independent and identically distributed (i.i.d.) random elements
93C10: Nonlinear systems in control theory
93C40: Adaptive control/observation systems
93C55: Discrete-time control/observation systems
93E10: Estimation and detection in stochastic control theory
93E20: Optimal stochastic control
60E99: Distribution theory
Related Items
Unnamed Item, Unnamed Item, Nonparametric adaptive control of discounted stochastic systems with compact state space, Continuous dependence of stochastic control models on the noise distribution, Nonparametric adaptive control of discrete-time partially observable stochastic systems, Discretization procedures for adaptive Markov control processes, Adaptive control of constrained Markov chains: Criteria and policies, Nonparametric estimation and adaptive control in a class of finite Markov decision chains, Limiting optimal discounted-cost control of a class of time-varying stochastic systems, Density estimation and adaptive control of Markov processes: Average and discounted criteria, Unnamed Item
Cites Work
- Optimal adaptive control of priority assignment in queueing systems
- Adaptive control of discounted Markov decision chains
- Nonstationary Markov decision problems with converging parameters
- Empirical processes: A survey of results for independent and identically distributed random variables
- Adaptive Strategies for Certain Classes of Controlled Markov Processes
- Estimation and control in discounted stochastic dynamic programming
- Optimal Plans for Dynamic Programming Problems
- Uniformity in weak convergence
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item