The method of value oriented successive approximations for the average reward Markov decision process (Q1144501)

scientific article

Language	Label	Description	Also known as
English	The method of value oriented successive approximations for the average reward Markov decision process	scientific article

Statements

instance of

scholarly article

0 references

title

The method of value oriented successive approximations for the average reward Markov decision process (English)

0 references

0 references

0 references

1980

0 references

zbMATH Keywords

value oriented successive approximations

0 references

average reward

0 references

finite state space

0 references

finite action space

0 references

almost optimal solutions

0 references

convergence

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Optimal decision procedures for finite Markov chains. Part II: Communicating systems

0 references

Q3245701

0 references

Q3251743

0 references

Technical Note—Bounds on the Gain of a Markov Decision Process

0 references

Technical Note—The Method of Successive Approximations and Markovian Decision Problems

0 references

Q3266141

0 references

Technical Note—Undiscounted Markov Renewal Programming Via Modified Successive Approximations

0 references

Discounting, Ergodicity and Convergence for Markov Decision Processes

0 references

A set of successive approximation methods for discounted Markovian decision problems

0 references

Q4190426

0 references

On Finding the Maximal Gain for Markov Decision Processes

0 references

Technical Note—Improved Conditions for Convergence in Undiscounted Markov Renewal Programming

0 references

Some Bounds for Discounted Sequential Decision Processes

0 references

Iterative solution of the functional equations of undiscounted Markov renewal programming

0 references

The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems

0 references

Geometric convergence of value-iteration in multichain Markov decision problems

0 references

A successive approximation algorithm for an undiscounted Markov decision process

0 references

Dynamic programming, Markov chains, and the method of successive approximations

0 references

full work available at URL

https://doi.org/10.1007/bf01719500

0 references

Identifiers

zbMATH Open document ID

0443.90109

0 references

DOI

10.1007/BF01719500

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1144501