Adaptive control of Markov processes with incomplete state information and unknown parameters

From MaRDI portal

Publication:1071659

Jump to:navigation, search

DOI10.1007/BF00941283zbMath0585.90090MaRDI QIDQ1071659

Onésimo Hernández-Lerma, Steven I. Marcus

Publication date: 1987

Published in: Journal of Optimization Theory and Applications (Search for Journal in Brave)

zbMATH Keywords

approximations unknown parameters nonstationary value iteration partially observed Markov decision processes principle of estimation and control asymptotically optimal adaptive policies discounted reward criterion optimal reward function

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)

Related Items (5)

On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes ⋮ Nonparametric adaptive control of discrete-time partially observable stochastic systems ⋮ Adaptive control of constrained Markov chains: Criteria and policies ⋮ Adaptive control of service in queueing systems ⋮ Optimal cost and policy for a Markovian replacement problem

Cites Work

This page was built for publication: Adaptive control of Markov processes with incomplete state information and unknown parameters

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1071659&oldid=13095047"