scientific article
From MaRDI portal
Publication:3835399
zbMath0678.93065MaRDI QIDQ3835399
Publication date: 1987
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Markov decision processesasymptotic optimalityunknown parametersadaptive policiesoptimal total expected discounted reward
Adaptive control/observation systems (93C40) Estimation and detection in stochastic control theory (93E10) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40) Stochastic systems in control theory (general) (93E03)
Related Items (4)
Density estimation and adaptive control of Markov processes: Average and discounted criteria ⋮ Discretization procedures for adaptive Markov control processes ⋮ Nonparametric estimation and adaptive control in a class of finite Markov decision chains ⋮ Recursive adaptive control of Markov decision processes with the average reward criterion
This page was built for publication: