On bidecision processes (Q1340581): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
Set OpenAlex properties.
 
(One intermediate revision by one other user not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1006/jmaa.1994.1382 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W1976430771 / rank
 
Normal rank

Latest revision as of 02:32, 20 March 2024

scientific article
Language Label Description Also known as
English
On bidecision processes
scientific article

    Statements

    On bidecision processes (English)
    0 references
    0 references
    14 December 1994
    0 references
    The author studies a (so-called) Markov bidecision process resulting from the standard Markov decision process by incorporating steps of maximization as well as minimization. With the help of an extended optimality equation he constructs a pair of policies, maximizing (resp. minimizing) the total reward in some sense. The pair of policies is found by a policy iteration method.
    0 references
    0 references
    Markov bidecision process
    0 references
    extended optimality equation
    0 references
    policy iteration
    0 references
    0 references