Finite state approximation algorithms for average cost denumerable state Markov decision processes (Q1066819)

From MaRDI portal

Revision as of 10:08, 30 July 2024 by Openalex240730090724 (talk | contribs) (Set OpenAlex properties.)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	Finite state approximation algorithms for average cost denumerable state Markov decision processes	scientific article

Statements

scholarly article

0 references

Finite state approximation algorithms for average cost denumerable state Markov decision processes (English)

0 references

0 references

0 references

publication date

1985

0 references

This paper describes six algorithms for finding approximate solutions to infinite state space Markov decision processes under the average cost criteria. Three of the algorithms are variants on value iteration and three are variants on policy iteration. The convergence of these algorithms is guaranteed by a scrambling-type recurrency condition, (which ensures the average cost is independent of the starting state), and ''tail'' conditions (which allows states in the tail to be ignored). Computational results on the various algorithms are given.

0 references

zbMATH Keywords

approximate solutions

0 references

infinite state space Markov decision processes

0 references

average cost criteria

0 references

value iteration

0 references

policy iteration

0 references

convergence

0 references

MaRDI profile type

MaRDI publication profile

0 references

Contraction Mappings in the Theory Underlying Dynamic Programming

0 references

Contraction mappings underlying undiscounted Markov decision problems

0 references

The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms

0 references

0 references

A note on simultaneous recurrence conditions on a set of denumerable stochastic matrices

0 references

Finite-state approximations to denumerable-state dynamic programs

0 references

0 references

0 references

The asymptotic behaviour of the minimal total expected cost for the denumerable state Markov decision model

0 references

An approximation procedure for stochastic dynamic programming in countable state space

0 references

Approximation procedure for stochastic dynamic programming based on clustering of state and action spaces

0 references

0 references

Iterative solution of the functional equations of undiscounted Markov renewal programming

0 references

The blast furnaces problem

0 references

Dynamic programming, Markov chains, and the method of successive approximations

0 references

Finite-state approximations for denumerable-state infinite-horizon discounted Markov decision processes

0 references

Finite state approximation for denumerable-state infinite horizon contracted Markov decision processes: The policy space method

0 references

Approximations of Dynamic Programs, I

0 references

Approximations of Dynamic Programs, II

0 references

full work available at URL

https://doi.org/10.1007/bf01719758

0 references

Identifiers

zbMATH Open document ID

0 references

10.1007/BF01719758

0 references

Mathematics Subject Classification ID

0 references

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1066819

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q1066819&oldid=37218378"