Density estimation and adaptive control of Markov processes: Average and discounted criteria

From MaRDI portal

Publication:2639029

Jump to:navigation, search

DOI10.1007/BF00049572zbMath0717.93066MaRDI QIDQ2639029

Rolando Cavazos-Cadena, Onésimo Hernández-Lerma

Publication date: 1990

Published in: Acta Applicandae Mathematicae (Search for Journal in Brave)

zbMATH Keywords

adaptive control density estimation nonstationary value iteration discrete-time Markov control processes principle of estimation and control unknown distribution average-reward criterion nonparametric adaptive policies

Mathematics Subject Classification ID

Nonparametric estimation (62G05) Estimation and detection in stochastic control theory (93E10) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40)

Related Items (12)

Bayesian estimation of the mean holding time in average semi-Markov control processes ⋮ Unnamed Item ⋮ Dynamic Pricing and Learning with Finite Inventories ⋮ Value iteration in average cost Markov control processes on Borel spaces ⋮ Empirical estimation in average Markov control processes ⋮ Adaptive discounted control for piecewise deterministic Markov processes ⋮ Nonparametric estimation and adaptive control in a class of finite Markov decision chains ⋮ Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Limiting optimal discounted-cost control of a class of time-varying stochastic systems ⋮ Unnamed Item

Cites Work

This page was built for publication: Density estimation and adaptive control of Markov processes: Average and discounted criteria

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2639029&oldid=15448151"