The method of value oriented successive approximations for the average reward Markov decision process (Q1144501): Difference between revisions

From MaRDI portal

Jump to:navigation, search

Latest revision as of 08:43, 30 July 2024

scientific article

Language	Label	Description	Also known as
English	The method of value oriented successive approximations for the average reward Markov decision process	scientific article

Statements

scholarly article

0 references

The method of value oriented successive approximations for the average reward Markov decision process (English)

0 references

0 references

0 references

publication date

1980

0 references

zbMATH Keywords

value oriented successive approximations

0 references

average reward

0 references

finite state space

0 references

finite action space

0 references

almost optimal solutions

0 references

convergence

0 references

MaRDI profile type

MaRDI publication profile

0 references

Optimal decision procedures for finite Markov chains. Part II: Communicating systems

0 references

0 references

0 references

Technical Note—Bounds on the Gain of a Markov Decision Process

0 references

Technical Note—The Method of Successive Approximations and Markovian Decision Problems

0 references

0 references

Technical Note—Undiscounted Markov Renewal Programming Via Modified Successive Approximations

0 references

Discounting, Ergodicity and Convergence for Markov Decision Processes

0 references

A set of successive approximation methods for discounted Markovian decision problems

0 references

0 references

On Finding the Maximal Gain for Markov Decision Processes

0 references

Technical Note—Improved Conditions for Convergence in Undiscounted Markov Renewal Programming

0 references

Some Bounds for Discounted Sequential Decision Processes

0 references

Iterative solution of the functional equations of undiscounted Markov renewal programming

0 references

The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems

0 references

Geometric convergence of value-iteration in multichain Markov decision problems

0 references

A successive approximation algorithm for an undiscounted Markov decision process

0 references

Dynamic programming, Markov chains, and the method of successive approximations

0 references

full work available at URL

https://doi.org/10.1007/bf01719500

0 references

Identifiers

zbMATH Open document ID

0 references

10.1007/BF01719500

0 references

Mathematics Subject Classification ID

0 references

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1144501

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q1144501&oldid=36934337"