Optimal threshold probability in undiscounted Markov decision processes with a target set. (Q1427885): Difference between revisions
From MaRDI portal
ReferenceBot (talk | contribs) Changed an Item |
Set OpenAlex properties. |
||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1016/s0096-3003(03)00158-9 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2083611438 / rank | |||
Normal rank |
Latest revision as of 11:10, 30 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Optimal threshold probability in undiscounted Markov decision processes with a target set. |
scientific article |
Statements
Optimal threshold probability in undiscounted Markov decision processes with a target set. (English)
0 references
14 March 2004
0 references
The paper is about minimizing risk models with a threshold criterion \(P_i^\pi(Z \leq r)\) where \(Z=\sum_{k=1}^{\tau-1} Y_k\) and \(\tau\) is a first passage time to a target set. The problem is modeled as undiscounted Markov decision process with discrete time space and countable state, action, and reward space. The main results are that the optimal value function is a unique solution to an optimality equation and that an optimal right continuous stationary policy exists. Some value iteration methods and a policy space iteration method are presented.
0 references
minimizing risk model
0 references
opimal value function
0 references
stationary policy
0 references