scientific article

From MaRDI portal

Publication:3835399

Jump to:navigation, search

zbMath0678.93065MaRDI QIDQ3835399

Rolando Cavazos-Cadena

Publication date: 1987

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

Markov decision processes asymptotic optimality unknown parameters adaptive policies optimal total expected discounted reward

Mathematics Subject Classification ID

Adaptive control/observation systems (93C40) Estimation and detection in stochastic control theory (93E10) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40) Stochastic systems in control theory (general) (93E03)

Related Items (4)

Density estimation and adaptive control of Markov processes: Average and discounted criteria ⋮ Discretization procedures for adaptive Markov control processes ⋮ Nonparametric estimation and adaptive control in a class of finite Markov decision chains ⋮ Recursive adaptive control of Markov decision processes with the average reward criterion

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3835399&oldid=17430575"