首页 | 本学科首页   官方微博 | 高级检索  
     

基于一般和随机对策论框架下的多智能体学习
引用本文:欧海涛, 张卫东, 许晓鸣. 基于一般和随机对策论框架下的多智能体学习. 自动化学报, 2002, 28(3): 423-426.
作者姓名:欧海涛  张卫东  许晓鸣
作者单位:1.上海交通大学自动化系,上海
基金项目:国家自然科学基金 ( 6 0 1 74 0 38)资助
摘    要:将Q-learning从单智能体框架上扩展到非合作的多智能体框架上,建立了在一般和随机对策框架下的多智能体理论框架和学习算法,提出了以Nash平衡点作为学习目标.给出了对策结构的约束条件,并证明了在此约束条件下算法的收敛性,对多智能体系统的研究与应用有重要意义.

关 键 词:多智能体   Q-learning   随机对策   Nash平衡点
收稿时间:2000-01-14
修稿时间:2000-01-14

MULTI-AGENT LEARNING BASED ON GENERAL-SUM STOCHASTIC GAMES
OU Hai-Tao, ZHANG Wei-Dong, XU Xiao-Ming. Multi-Agent Learning Based on General-Sum Stochastic Games. ACTA AUTOMATICA SINICA, 2002, 28(3): 423-426.
Authors:OU Hai-Tao  ZHANG Wei-Dong  XU Xiao-Ming
Affiliation:1. Department of Automation,Shanghai Jiaotong University,Shanghai
Abstract:Q -learning from original single-agent framework is extended to non-cooperative multi-agent framework, and the theoretic framework of multi-agent learning is proposed under general-sum stochastic games with Nash equilibrium point as learning objective. We introduce a multi-agent Q -learning algorithm and prove its convergence under certain restriction, which is very important for the study and application of multi-agent system.
Keywords:Multi-agent   Q -learning   stochastic games   Nash equilibrium point  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号