Action-dependent stopping times and Markov decision process with unbounded rewards (Q1158111)

From MaRDI portal

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	Action-dependent stopping times and Markov decision process with unbounded rewards	scientific article

Statements

scholarly article

0 references

Action-dependent stopping times and Markov decision process with unbounded rewards (English)

0 references

0 references

publication date

1981

0 references

zbMATH Keywords

successive-approximation method

0 references

semi Markov decision processes

0 references

unbounded rewards

0 references

actions-dependent stopping time

0 references

algorithm

0 references

equal-row- sum property

0 references

lower bounds

0 references

elimination of non-optimal actions

0 references

upper bounds

0 references

J. A. E. E. Van Nunen

0 references

Shaler jun. Stidham

0 references

MaRDI profile type

MaRDI publication profile

0 references

Note—A Test for Nonoptimal Actions in Undiscounted Finite Markov Decision Chains

0 references

Markov decision processes and strongly excessive functions

0 references

0 references

Applying a New Device in the Optimization of Exponential Queuing Systems

0 references

On Dynamic Programming with Unbounded Rewards

0 references

Discounting, Ergodicity and Convergence for Markov Decision Processes

0 references

0 references

A set of successive approximation methods for discounted Markovian decision problems

0 references

0 references

Note—A Note on Dynamic Programming with Unbounded Rewards

0 references

Successive approximations for Markov decision processes and Markov games with unbounded rewards

0 references

On theory and algorithms for Markov decision problems with the total reward criterion

0 references

Bounds and Transformations for Discounted Finite Markov Decision Chains

0 references

0 references

Iterative solution of the functional equations of undiscounted Markov renewal programming

0 references

0 references

Technical Note—An Equivalence Between Continuous and Discrete Time Markov Decision Processes

0 references

Markov programming by successive approximations with respect to weighted supremum norms

0 references

full work available at URL

https://doi.org/10.1007/bf01783952

0 references

Identifiers

zbMATH Open document ID

0 references

10.1007/BF01783952

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1158111

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q1158111&oldid=36930513"