Optimal threshold probability in undiscounted Markov decision processes with a target set. (Q1427885): Difference between revisions

The paper is about minimizing risk models with a threshold criterion \(P_i^\pi(Z \leq r)\) where \(Z=\sum_{k=1}^{\tau-1} Y_k\) and \(\tau\) is a first passage time to a target set. The problem is modeled as undiscounted Markov decision process with discrete time space and countable state, action, and reward space. The main results are that the optimal value function is a unique solution to an optimality equation and that an optimal right continuous stationary policy exists. Some value iteration methods and a policy space iteration method are presented.

0 references

reviewed by

Matthias Ehrgott

0 references

zbMATH Keywords

minimizing risk model

0 references

opimal value function

0 references

stationary policy

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

An Analysis of Stochastic Shortest Path Problems

0 references

Discrete Dynamic Programming

0 references

Target-level criterion in Markov decision processes

0 references

Contraction Mappings in the Theory Underlying Dynamic Programming

0 references

On Sequential Decisions and Markov Chains

0 references

Finite state Markovian decision processes

0 references

Percentile performance criteria for limiting average Markov decision processes

0 references

Q4255598

0 references

Optimal policy for minimizing risk models in Markov decision processes

0 references

Equivalence classes for optimizing risk models in Markov decision processes.

0 references

Minimizing risk models in stochastic shortest path problems

0 references

The variance of discounted Markov decision processes

0 references

Discrete Dynamic Programming with Sensitive Discount Optimality Criteria

0 references

A Survey of Applications of Markov Decision Processes

0 references

Minimising a threshold probability in discounted Markov decision processes

0 references

Minimizing risk models in Markov decision processes with policies depending on target values

0 references

full work available at URL

https://doi.org/10.1016/s0096-3003(03)00158-9

0 references

Identifiers

zbMATH Open document ID

1084.91022

0 references

DOI

10.1016/S0096-3003(03)00158-9

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1427885

@@ Property / full work available at URL @@
+https://doi.org/10.1016/s0096-3003(03)00158-9
+Normal rank
@@ Property / OpenAlex ID @@
+W2083611438
@@ Property / OpenAlex ID: W2083611438 / rank @@
+Normal rank