Steering policies for controlled Markov chains under a recurrence condition

From MaRDI portal

Publication:4506874

Jump to:navigation, search

DOI10.1109/9.780427zbMath0955.93061MaRDI QIDQ4506874

Armand M. Makowski, Dye-Jyun Ma

Publication date: 17 October 2000

Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1109/9.780427

zbMATH Keywords

adaptive control; Markov decision processes; sample path arguments; controlled Markov chains; recurrence condition; sample average costs

Mathematics Subject Classification ID

93E20: Optimal stochastic control

93E35: Stochastic learning and adaptive control

90C40: Markov and semi-Markov decision processes

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4506874&oldid=18602697"