The LP approach in average reward MDPs with multiple cost constraints: The countable state case

From MaRDI portal

Publication:4354089

Jump to:navigation, search

DOI10.1080/02522667.1997.10699311MaRDI QIDQ4354089zbMATH OpenOpenAlexFDO

Authors Youqiang Huang, Masami Kurano

Publication date 10 September 1997

Published in Journal of Information and Optimization Sciences (Search for Journal in Brave)

Full work available at URL https://doi.org/10.1080/02522667.1997.10699311

zbMATH Keywords

duality gap constrained optimal stationary policy countable state average reward Markov decision processes multiple cost constraints

Mathematics Subject Classification ID

Linear programming (90C05) Markov and semi-Markov decision processes (90C40)

Recommendations

Cites work

Cited in

(8)

This page was built for publication: The LP approach in average reward MDPs with multiple cost constraints: The countable state case

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4354089)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4354089&oldid=18334056"