Asymptotic behavior of a hierarchical system of learning automata |
| |
Authors: | M.A.L. Thathachar K.M. Ramachandran |
| |
Affiliation: | Department of Electrical Engineering, Indian Institute of Science, Bangalore 560012, India;T.I.F.R. Centre, Post Box 1234, Indian Institute of Science Campus, Bangalore 560012, India |
| |
Abstract: | Learning automata arranged in a two-level hierarchy are considered. The automata operate in a stationary random environment and update their action probabilities according to the linear-reward-ε-penalty algorithm at each level. Unlike some hierarchical systems previously proposed, no information transfer exists from one level to another, and yet the hierarchy possesses good convergence properties. Using weak-convergence concepts it is shown that for large time and small values of parameters in the algorithm, the evolution of the optimal path probability can be represented by a diffusion whose parameters can be computed explicitly. |
| |
Keywords: | |
本文献已被 ScienceDirect 等数据库收录! |
|