多智能体的增强学习及其在RoboCup中的应用 Reinforcement learning for Multi-Agents Systems and its application in RoboCup期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

多智能体的增强学习及其在RoboCup中的应用

引用本文：	刘国栋,杨宝庆.多智能体的增强学习及其在RoboCup中的应用[J].计算机工程与应用,2008,44(23):46-48.

作者姓名：	刘国栋杨宝庆

作者单位：	江南大学控制科学与工程研究中心，江苏无锡 214122

摘要：	针对非确定马尔可夫环境下的多智能体系统，提出了多智能体Q学习模型和算法。算法中通过对联合动作的统计来学习其它智能体的行为策略，并利用智能体策略向量的全概率分布保证了对联合最优动作的选择。在实验中，成功实现了智能体的决策，提高了AFU队的整体的对抗能力，证明了算法的有效性和可行性。
关键词：	多智能体增强学习机器人世界杯足球锦标赛
收稿时间：	2007-10-18
修稿时间：	2008-1-21
Reinforcement learning for Multi-Agents Systems and its application in RoboCup

LIU Guo-dong,YANG Bao-qing.Reinforcement learning for Multi-Agents Systems and its application in RoboCup[J].Computer Engineering and Applications,2008,44(23):46-48.

Authors:	LIU Guo-dong YANG Bao-qing

Affiliation:	School of Communication and Control Engineering，Jiangnan University，Wuxi，Jiangsu 214122，China

Abstract:	Due to the presence of other agents，the environment of Multi-Agent Systems（MAS） cannot be simply treated as Markov Decision Processes（MDPs）.The current reinforcement learning which are based on MDPs must be reformed before it can be applicable to MAS.Based on an agent’s independent learning ability，this paper proposes a novel Q-learning algorithm for MAS-an agent learning other agents action policies through observing the joint action.The politicies of other agents are expressed as action probability distribution matrixes.A concise and yet useful updating method for the matrixes is proposed.The full joint probability of distribution matrixes guarantees the learning agent to choose its optimal action.In experiment，the implemention of the agent and the enhancement of AFU shows that the approach is valid and efficient.

Keywords:	Multi-Agents Systems(MAS) reinforcement learning Robot World Cup(RoboCup)
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏