基于一般和随机对策论框架下的多智能体学习 MULTI-AGENT LEARNING BASED ON GENERAL-SUM STOCHASTIC GAMES期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于一般和随机对策论框架下的多智能体学习

引用本文：	欧海涛, 张卫东, 许晓鸣. 基于一般和随机对策论框架下的多智能体学习. 自动化学报, 2002, 28(3): 423-426.

作者姓名：	欧海涛张卫东许晓鸣

作者单位：	1.上海交通大学自动化系,上海

基金项目：	国家自然科学基金 ( 6 0 1 74 0 38)资助

摘要：	将Q-learning从单智能体框架上扩展到非合作的多智能体框架上,建立了在一般和随机对策框架下的多智能体理论框架和学习算法,提出了以Nash平衡点作为学习目标.给出了对策结构的约束条件,并证明了在此约束条件下算法的收敛性,对多智能体系统的研究与应用有重要意义.
关键词：	多智能体 Q-learning 随机对策 Nash平衡点
收稿时间：	2000-01-14
修稿时间：	2000-01-14
MULTI-AGENT LEARNING BASED ON GENERAL-SUM STOCHASTIC GAMES

OU Hai-Tao, ZHANG Wei-Dong, XU Xiao-Ming. Multi-Agent Learning Based on General-Sum Stochastic Games. ACTA AUTOMATICA SINICA, 2002, 28(3): 423-426.

Authors:	OU Hai-Tao ZHANG Wei-Dong XU Xiao-Ming

Affiliation:	1. Department of Automation,Shanghai Jiaotong University,Shanghai

Abstract:	Q -learning from original single-agent framework is extended to non-cooperative multi-agent framework, and the theoretic framework of multi-agent learning is proposed under general-sum stochastic games with Nash equilibrium point as learning objective. We introduce a multi-agent Q -learning algorithm and prove its convergence under certain restriction, which is very important for the study and application of multi-agent system.

Keywords:	Multi-agent Q -learning stochastic games Nash equilibrium point
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《自动化学报》浏览原始摘要信息
	点击此处可从《自动化学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏