scientific article

From MaRDI portal

Revision as of 12:22, 5 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:3772003

Jump to:navigation, search

zbMath0633.90091MaRDI QIDQ3772003

Onésimo Hernández-Lerma

Publication date: 1987

Full work available at URL: https://eudml.org/doc/28802

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

unknown parameters approximation procedures value-iteration average-reward controlled Markov processes Borel state and control spaces optimal adaptive policies

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)

Related Items (2)

Density estimation and adaptive control of Markov processes: Average and discounted criteria ⋮ A forecast horizon and a stopping rule for general Markov decision processes

Cites Work

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3772003&oldid=17312770"