scientific article; zbMATH DE number 4170671

From MaRDI portal

Publication:3496174

Jump to:navigation, search

MaRDI QIDQ3496174zbMATH OpenFDO

Authors Jinjong Zhan, Yun Zhang

Publication date 1987

zbMATH Keywords

stationary policy unbounded rewards \(\beta \) -optimal policy Borel decision

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)

Recommendations

Cited in

(9)

Existence of optimal stationary policies in discounted Markov decision processes: Approaches by occupation measures
scientific article; zbMATH DE number 4003943 (Why is no real title available?)
Estimate and approximate policy iteration algorithm for discounted Markov decision models with bounded costs and Borel spaces
scientific article; zbMATH DE number 3898637 (Why is no real title available?)
On the properties of ( 0) optimal policies in discounted unbounded return model
PAC Bounds for Discounted MDPs
scientific article; zbMATH DE number 4158414 (Why is no real title available?)
scientific article; zbMATH DE number 4170670 (Why is no real title available?)
Bounded Parameter Markov Decision Processes with Average Reward Criterion

This page was built for publication:

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3496174)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3496174&oldid=16843547"