Solving Markovian decision processes by successive elimination of variables (Q1105497)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Solving Markovian decision processes by successive elimination of variables
scientific article

    Statements

    Solving Markovian decision processes by successive elimination of variables (English)
    0 references
    1988
    0 references
    Semi-Markov decision processes are solved by some algorithms which are interesting mainly for formal analytical reasons and only in very special cases for computational purposes, too. The core of the algorithms consists in eliminating one component of the (relative) value function whereas the set of actions increases in dimensionality, finally arriving at optimization over the whole set of stationary policies. The discounted case is treated only to show this main feature. In the undiscounted case the first algorithm shows that the optimality equations (with gain rate independent of the state) has a solution if the underlying problem has a constant maximal gain rate (without any structural condition). The second algorithm constructs such a solution, possibly with some free parameters, whenever a solution exists.
    0 references
    successive elimination of variables
    0 references
    estimation of state variables
    0 references
    Semi- Markov decision processes
    0 references
    algorithms
    0 references
    discounted case
    0 references
    undiscounted case
    0 references
    0 references

    Identifiers