首页 | 本学科首页   官方微博 | 高级检索  
     

基于强化学习的多策略选择遗传算法
引用本文:王晓燕,刘全,傅启明,张乐. 基于强化学习的多策略选择遗传算法[J]. 计算机工程, 2011, 37(8): 149-152. DOI: 10.3969/j.issn.1000-3428.2011.08.051
作者姓名:王晓燕  刘全  傅启明  张乐
作者单位:苏州大学计算机科学与技术学院,江苏,苏州,215006
基金项目:国家自然科学基金资助项目,江苏省自然科学基金资助项目,江苏省高校自然科学研究基金资助项目,江苏省现代企业信息化应用支撑软件工程技术研究开发中心基金资助项目
摘    要:为解决传统遗传算法早熟收敛和收敛速度慢的问题,提出一种基于强化学习的多策略选择遗传算法MPSGA。通过使用不同的选择策略将整个种群划分为3个子种群并分别进化,能提高种群的多样性,有效避免遗传算法的早熟收敛问题。将种群的多样性和算法的运行机制相结合,根据种群多样性的变化运用强化学习算法动态地优化各子种群间的比例参数,从而将种群多样性保持在合适的范围,一定程度上解决了收敛速度和全局收敛性之间的矛盾。实验结果表明,该算法在收敛精度和搜索效率上都表现出较好的性能。

关 键 词:遗传算法  多策略选择  强化学习  种群多样性  比例参数

Multiple Policy Selection Genetic Algorithm Based on Reinforcement Learning
WANG Xiao-yan,LIU Quan,FU Qi-ming,ZHANG Le. Multiple Policy Selection Genetic Algorithm Based on Reinforcement Learning[J]. Computer Engineering, 2011, 37(8): 149-152. DOI: 10.3969/j.issn.1000-3428.2011.08.051
Authors:WANG Xiao-yan  LIU Quan  FU Qi-ming  ZHANG Le
Affiliation:(School of Computer Science and Technology,Soochow University,Suzhou 215006,China)
Abstract:A new multiple policy selection Genetic Algorithm(GA) based on reinforcement learning is proposed to avoid the premature convergence and low speed of convergence.The whole population is divided into three sub-populations and each of them evolves respectively by using several different selection policies,which improves the diversity of population and avoids the premature convergence effectively.Population diversity is associated with the running mechanism of the algorithm,and the parameters of the sub-populations are optimized dynamically using reinforcement learning according to the variance diversity,which can maintain the population diversity in the appropriate range and it solves the contradiction between convergence speed and global convergence to a certain extent.Experimental results show that the algorithm has a high performance in precision of convergence and search efficiency.
Keywords:Genetic Algorithm(GA); multiple policy selection; reinforcement learning; population diversity; proportion parameter;
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号