On bidecision processes (Q1340581): Difference between revisions
From MaRDI portal
Set profile property. |
Set OpenAlex properties. |
||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1006/jmaa.1994.1382 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W1976430771 / rank | |||
Normal rank |
Latest revision as of 01:32, 20 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | On bidecision processes |
scientific article |
Statements
On bidecision processes (English)
0 references
14 December 1994
0 references
The author studies a (so-called) Markov bidecision process resulting from the standard Markov decision process by incorporating steps of maximization as well as minimization. With the help of an extended optimality equation he constructs a pair of policies, maximizing (resp. minimizing) the total reward in some sense. The pair of policies is found by a policy iteration method.
0 references
Markov bidecision process
0 references
extended optimality equation
0 references
policy iteration
0 references