Finding optimal memoryless policies of POMDPs under the expected average reward criterion

From MaRDI portal

(Redirected from Publication:418072)

Jump to:navigation, search

DOI10.1016/J.EJOR.2010.12.014MaRDI QIDQ418072zbMATH OpenOpenAlexFDO

Authors Baoqun Yin, Yanjie Li, Hongsheng Xi

Publication date 14 May 2012

Published in European Journal of Operational Research (Search for Journal in Brave)

Full work available at URL https://doi.org/10.1016/j.ejor.2010.12.014

zbMATH Keywords

correlated actions memoryless policy performance difference policy iteration with step sizes POMDPs

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)

Recommendations

Cites work

Cited in

(6)

Describes a project that uses

Uses Software

POMDP

This page was built for publication: Finding optimal memoryless policies of POMDPs under the expected average reward criterion

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q418072)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Finding_optimal_memoryless_policies_of_POMDPs_under_the_expected_average_reward_criterion&oldid=61692921"