On bidecision processes (Q1340581)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | On bidecision processes |
scientific article |
Statements
On bidecision processes (English)
0 references
14 December 1994
0 references
The author studies a (so-called) Markov bidecision process resulting from the standard Markov decision process by incorporating steps of maximization as well as minimization. With the help of an extended optimality equation he constructs a pair of policies, maximizing (resp. minimizing) the total reward in some sense. The pair of policies is found by a policy iteration method.
0 references
Markov bidecision process
0 references
extended optimality equation
0 references
policy iteration
0 references