基于Agent的递阶强化学习模型与体系结构 Model and Architecture of Hierarchical Reinforcement Learning Based on Agent期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于Agent的递阶强化学习模型与体系结构

引用本文：	王文玺,肖世德,孟祥印,陈应松,张卫华. 基于Agent的递阶强化学习模型与体系结构[J]. 机械工程学报, 2010, 46(2). DOI: 10.3901/JME.2010.02.076

作者姓名：	王文玺肖世德孟祥印陈应松张卫华

作者单位：	1. 西南交通大学机械工程学院,成都,610031 2. 西南交通大学牵引动力国家重点实验室,成都,610031

基金项目：	国家重点基础研究发展计划资助项目(973计划,2007CB714701)

摘要：	通过引入频率最大Q值启发式学习算法,对一种递阶强化学习方法进行改进,解决在庞大状态空间和动态变化环境中对Agent进行最优行为策略学习的问题。引入属性维护算子以及承诺和规划意识属性,对经典信念、愿望、意图模型进行扩展,给出意识属性的理性维护过程,增强Agent的自适应性并使Agent具有在动态环境中进行在线学习的能力。根据意识模型提出一种具有主动性、适应性、反应性、社会性的Agent体系结构,并根据该体系结构开发出一种路径规划Agent。通过对行驶环境的组态设定,模拟车辆复杂的行驶状态,并通过对行驶状态的不断学习,最终获得最优路径,证明体系结构的可行性和有效性。
关键词：	Agent 强化学习体系结构意识模型
Model and Architecture of Hierarchical Reinforcement Learning Based on Agent

WANG Wenxi,XIAO Shide,MENG Xiangyin,CHEN Yingsong,ZHANG Weihua. Model and Architecture of Hierarchical Reinforcement Learning Based on Agent[J]. Chinese Journal of Mechanical Engineering, 2010, 46(2). DOI: 10.3901/JME.2010.02.076

Authors:	WANG Wenxi XIAO Shide MENG Xiangyin CHEN Yingsong ZHANG Weihua

Affiliation:	1. School of Mechanical Engineering/a>;Southwest Jiaotong University/a>;Chengdu 610031/a>;2. Traction Power State Key Laboratory/a>;Chengdu 610031

Abstract:	By introducing frequency maximum Q heuristic learning algorithm,a hierarchical reinforcement learning method is improved,this method solves the problem of agent optimal strategy learning in a large scale state space and dynamic changing environment. Bringing attribute maintenance operator,the attribute of promise and layout into the classical belief,desire,intention(BDI) model,which is modified to increase the adaptability and in-line learning ability of agent,the rational maintenance process of consciousne...

Keywords:	Agent Reinforcement learning Architecture Consciousness model
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏