On conditional optimality of a class of learning automata in random environments (Q800091)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | On conditional optimality of a class of learning automata in random environments |
scientific article |
Statements
On conditional optimality of a class of learning automata in random environments (English)
0 references
1983
0 references
The author analyzes the asymptotic behavior of a new class of learning automata in Q-model environments (i.e., random environments with a finite number of responses). The reinforcement scheme adopted has just the same form as the Bayesian learning scheme, and is specialized to coincide with the \(\beta\) -model scheme introduced by R. D. Luce for an environment with binary responses (a P-model environment). A learning automaton is optimal, by definition, if it converges to its goal with probability one as time goes to infinity. Under some restrictions on the environments, the author proves the optimality of the defined automata. A method for constructing a special type of learning automata is presented, which is based on the use of the maximum-entropy principle. By applying this method to any Q-model environment and suitably choosing the output function, we can get an effective learning automaton with simple structure. The optimality of the resulting automaton is also investigated.
0 references
asymptotic behavior
0 references
learning automata
0 references
random environments
0 references
optimality
0 references
maximum-entropy principle
0 references
0 references