A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion

From MaRDI portal
(Redirected from Publication:481787)












This page was built for publication: A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q481787)