A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion

From MaRDI portal
Publication:481787

DOI10.1007/S10957-013-0474-6zbMATH Open1302.90241OpenAlexW1999785104WikidataQ124799376 ScholiaQ124799376MaRDI QIDQ481787FDOQ481787


Authors: Rolando Cavazos-Cadena, Karel Sladký, Raúl Montes-de-Oca Edit this on Wikidata


Publication date: 15 December 2014

Published in: Journal of Optimization Theory and Applications (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10957-013-0474-6




Recommendations




Cites Work


Cited In (6)





This page was built for publication: A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q481787)