首页 | 本学科首页   官方微博 | 高级检索  
     


Decentralized Learning in Markov Games
Authors:Vrancx   P. Verbeeck   K. Nowe   A.
Affiliation:Comput. Modeling Lab., Vrije Univ. Brussel, Brussels;
Abstract: Learning automata (LA) were recently shown to be valuable tools for designing multiagent reinforcement learning algorithms. One of the principal contributions of the LA theory is that a set of decentralized independent LA is able to control a finite Markov chain with unknown transition probabilities and rewards. In this paper, we propose to extend this algorithm to Markov games—a straightforward extension of single-agent Markov decision problems to distributed multiagent decision problems. We show that under the same ergodic assumptions of the original theorem, the extended algorithm will converge to a pure equilibrium point between agent policies.
Keywords:
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号