Action-dependent stopping times and Markov decision process with unbounded rewards (Q1158111)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Action-dependent stopping times and Markov decision process with unbounded rewards |
scientific article |
Statements
Action-dependent stopping times and Markov decision process with unbounded rewards (English)
0 references
1981
0 references
successive-approximation method
0 references
semi Markov decision processes
0 references
unbounded rewards
0 references
actions-dependent stopping time
0 references
algorithm
0 references
equal-row- sum property
0 references
lower bounds
0 references
elimination of non-optimal actions
0 references
upper bounds
0 references
0 references
0 references
0 references