scientific article; zbMATH DE number 4031438

From MaRDI portal

Publication:3772003

Jump to:navigation, search

MaRDI QIDQ3772003zbMATH OpenFDO

Authors Onésimo Hernández-Lerma

Publication date 1987

Full work available at URL https://eudml.org/doc/28802

zbMATH Keywords

unknown parameters approximation procedures value-iteration average-reward controlled Markov processes Borel state and control spaces optimal adaptive policies

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)

Recommendations

Cites work

Cited in

(14)

This page was built for publication:

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3772003)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3772003&oldid=17312770"