Action-dependent stopping times and Markov decision process with unbounded rewards (Q1158111)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Action-dependent stopping times and Markov decision process with unbounded rewards
scientific article

    Statements

    Action-dependent stopping times and Markov decision process with unbounded rewards (English)
    0 references
    0 references
    1981
    0 references
    successive-approximation method
    0 references
    semi Markov decision processes
    0 references
    unbounded rewards
    0 references
    actions-dependent stopping time
    0 references
    algorithm
    0 references
    equal-row- sum property
    0 references
    lower bounds
    0 references
    elimination of non-optimal actions
    0 references
    upper bounds
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references