多智能体分层强化学习综述 A survey on multi-agent hierarchical reinforcement learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

多智能体分层强化学习综述

引用本文：	殷昌盛,杨若鹏,朱巍,邹小飞,李峰.多智能体分层强化学习综述[J].智能系统学报,2020,15(4):646-655.

作者姓名：	殷昌盛杨若鹏朱巍邹小飞李峰

作者单位：	国防科技大学信息通信学院，湖北武汉 430010

摘要：	作为机器学习和人工智能领域的一个重要分支，多智能体分层强化学习以一种通用的形式将多智能体的协作能力与强化学习的决策能力相结合，并通过将复杂的强化学习问题分解成若干个子问题并分别解决，可以有效解决空间维数灾难问题。这也使得多智能体分层强化学习成为解决大规模复杂背景下智能决策问题的一种潜在途径。首先对多智能体分层强化学习中涉及的主要技术进行阐述，包括强化学习、半马尔可夫决策过程和多智能体强化学习；然后基于分层的角度，对基于选项、基于分层抽象机、基于值函数分解和基于端到端等4种多智能体分层强化学习方法的算法原理和研究现状进行了综述；最后介绍了多智能体分层强化学习在机器人控制、博弈决策以及任务规划等领域的应用现状。
关键词：	人工智能机器学习强化学习多智能体综述深度学习分层强化学习应用现状
A survey on multi-agent hierarchical reinforcement learning

YIN Changsheng,YANG Ruopeng,ZHU Wei,ZOU Xiaofei,LI Feng.A survey on multi-agent hierarchical reinforcement learning[J].CAAL Transactions on Intelligent Systems,2020,15(4):646-655.

Authors:	YIN Changsheng YANG Ruopeng ZHU Wei ZOU Xiaofei LI Feng

Affiliation:	School of Information and Communication, National University of Defense Technology, Wuhan 430010, China

Abstract:	As an important research area in the field of machine learning and artificial intelligence, multi-agent hierarchical reinforcement learning (MAHRL) integrates the advantages of the collaboration of multi-agent system (MAS) and the decision making of reinforcement learning (RL) in a general-purpose form, and decomposes the RL problem into sub-problems and solves each of them to overcome the so-called curse of dimensionality. So MAHRL offers a potential way to solve large-scale and complex decision problem. In this paper, we systematically describe three key technologies of MAHRL: reinforcement learning (RL), Semi Markov Decision Process (SMDP), multi-agent reinforcement learning (MARL). We then systematically describe four main categories of the MAHRL method from the angle of hierarchical learning, which includes Option, HAM, MAXQ and End-to-End. Finally, we end up with summarizing the application status of MAHRL in robot control, game decision making and mission planning.

Keywords:	artificial intelligence machine learning reinforcement learning multi-agent summary reinforcement learning hierarchical reinforcement learning application status

	点击此处可从《智能系统学报》浏览原始摘要信息
	点击此处可从《智能系统学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏