一种新的多智能体强化学习算法及其在多机器人协作任务中的应用 A NEW MULTI-AGENT REINFORCEMENT LEARNING ALGORITHM AND ITS APPLICATION TO MULTI-ROBOT COOPERATION TASKS期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种新的多智能体强化学习算法及其在多机器人协作任务中的应用

引用本文：	顾国昌,仲宇,张汝波.一种新的多智能体强化学习算法及其在多机器人协作任务中的应用[J].机器人,2003,25(4):344-348.

作者姓名：	顾国昌仲宇张汝波

作者单位：	1. 哈尔滨工程大学计算机科学与技术学院,哈尔滨,150001 2. 哈尔滨工程大学计算机科学与技术学院,哈尔滨,150001;中国科学院沈阳自动化研究所机器人学重点实验室,沈阳,110016

基金项目：	中国科学院机器人学开放研究实验室基金资助 (RL 2 0 0 10 6 )，武器装备预研基金项目及国防基础研究基金的资助

摘要：	在多机器人系统中，评价一个机器人行为的好坏常常依赖于其它机器人的行为，此时必须采用组合动作以实现多机器人的协作，但采用组合动作的强化学习算法由于学习空间异常庞大而收敛得极慢．本文提出的新方法通过预测各机器人执行动作的概率来降低学习空间的维数，并应用于多机器人协作任务之中．实验结果表明，基于预测的加速强化学习算法可以比原始算法更快地获得多机器人的协作策略．
关键词：	分布式强化学习加速算法多智能体系统
文章编号：	1002-0446(2003)04-0344-06
A NEW MULTI-AGENT REINFORCEMENT LEARNING ALGORITHM AND ITS APPLICATION TO MULTI-ROBOT COOPERATION TASKS

GU Guo-chang ,ZHONG Yu ,ZHANG Ru-bo.A NEW MULTI-AGENT REINFORCEMENT LEARNING ALGORITHM AND ITS APPLICATION TO MULTI-ROBOT COOPERATION TASKS[J].Robot,2003,25(4):344-348.

Authors:	GU Guo-chang ZHONG Yu ZHANG Ru-bo

Affiliation:	GU Guo-chang 1,ZHONG Yu 1,ZHANG Ru-bo 1,2

Abstract:	In multi-robot systems, joint-action must be employed to achieve cooperation because the evaluation to the behavior of a robot often depends on the other robots' behaviors. However, joint-action reinforcement learning algorithms suffer the slow convergence rate because of the enormous learning space produced by joint-action. In this paper, a prediction-based reinforcement learning algorithm is presented for multi-robot cooperation tasks, which demands all robots to learn paper predict the probabilities of actions that other robots may execute. A multi-robot cooperation experiment is made to test the efficacy of the new algorithm, and the experiment results show that the new algorithm can achieve the cooperation strategy much faster than the primitive reinforcement learning algorithm.

Keywords:	distributed reinforcement learning accelerating algorithm multi-agent system
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《机器人》浏览原始摘要信息
	点击此处可从《机器人》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏