Multiaction learning automata possessing ergodicity of the mean (Q1067787)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Multiaction learning automata possessing ergodicity of the mean
scientific article

    Statements

    Multiaction learning automata possessing ergodicity of the mean (English)
    0 references
    0 references
    0 references
    1985
    0 references
    Multiaction learning automata which update their action probabilities on the basis of the responses they get from an environment are considered in this paper. The automata update the probabilities according to whether the environment responds with a reward or a penalty. Learning automata are said to possess ergodicity of the mean if the mean action probability is the state probability (or unconditional probability) of an ergodic Markov chain. In an earlier paper [IEEE Trans. Syst. Man. Cybern. SMC-13, 1143-1148 (1983; Zbl 0537.68052)] we considered the problem of a two- action learning automaton being ergodic in the mean (EM). The family of such automata was characterized completely by proving the necessary and sufficient conditions for automata to be EM. In this paper, we generalize our earlier results [loc. cit.] and obtain necessary and sufficient conditions for the multiaction learning automaton to be EM. These conditions involve two families of probability updating functions. It is shown that for the automaton to be EM the two families must be linearly dependent. The vector defining the linear dependence is the only vector parameter which controls the rate of convergence of the automaton. Further, the technique for reducing the variance of the limiting distribution is discussed. Just as in the two-action case, it is shown that the set of absolutely expedient schemes and the set of schemes which possess ergodicity of the mean are mutually disjoint.
    0 references
    mean action probability
    0 references
    ergodic Markov chain
    0 references
    probability updating functions
    0 references
    linear dependence
    0 references

    Identifiers