The control of a two-level Markov decision process by time aggregation

From MaRDI portal

Publication:2641752

Jump to:navigation, search

DOI10.1016/j.automatica.2005.11.006zbMath1127.90074MaRDI QIDQ2641752

Yat-Wah Wan, Cao, Xiren

Publication date: 23 August 2007

Published in: Automatica (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.automatica.2005.11.006

zbMATH Keywords

Markov decision processes; policy iteration; performance potentials; time aggregation; coupled decisions; two-level systems

Mathematics Subject Classification ID

93E20: Optimal stochastic control

90C40: Markov and semi-Markov decision processes

Related Items

Power and delay optimisation in multi-hop wireless networks

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2641752&oldid=15446608"