首页 | 本学科首页   官方微博 | 高级检索  
     

风险调度中引入知识迁移的细菌觅食强化学习优化算法
引用本文:韩传家,张孝顺,余涛,瞿凯平. 风险调度中引入知识迁移的细菌觅食强化学习优化算法[J]. 电力系统自动化, 2017, 41(8): 69-77
作者姓名:韩传家  张孝顺  余涛  瞿凯平
作者单位:华南理工大学电力学院, 广东省广州市 510640,华南理工大学电力学院, 广东省广州市 510640,华南理工大学电力学院, 广东省广州市 510640,华南理工大学电力学院, 广东省广州市 510640
基金项目:国家重点基础研究发展计划(973计划)资助项目(2013CB228205);国家自然科学基金资助项目(51477055)
摘    要:针对电力系统运行过程中负荷及故障的不确定性,在经济调度中引入风险评估原理,并提出了一种全新的基于知识迁移的细菌觅食强化学习优化算法。该算法将细菌觅食算法的寻优模式与Q学习算法的试错迭代机制结合,利用多主体协同合作来更新共有的知识矩阵,并以基于知识延伸的维度缩减方式避免了"维数灾难"。在预学习获得最优知识矩阵后,利用知识迁移加速在线学习进程。IEEE RTS-79测试系统的仿真结果表明:所提算法在保证获得高质量最优解的同时,寻优速度可达经典智能算法的9~20倍,适合求解大规模复杂电网的风险调度快速优化。

关 键 词:知识迁移  细菌觅食  强化学习  风险调度
收稿时间:2016-06-19
修稿时间:2017-01-13

Optimization Algorithm of Reinforcement Learning Based Knowledge Transfer Bacteria Foraging for Risk Dispatch
HAN Chuanji,ZHANG Xiaoshun,YU Tao and QU Kaiping. Optimization Algorithm of Reinforcement Learning Based Knowledge Transfer Bacteria Foraging for Risk Dispatch[J]. Automation of Electric Power Systems, 2017, 41(8): 69-77
Authors:HAN Chuanji  ZHANG Xiaoshun  YU Tao  QU Kaiping
Affiliation:School of Electric Power, South China University of Technology, Guangzhou 510640, China,School of Electric Power, South China University of Technology, Guangzhou 510640, China,School of Electric Power, South China University of Technology, Guangzhou 510640, China and School of Electric Power, South China University of Technology, Guangzhou 510640, China
Abstract:Referring to the load and fault uncertainty during power system operation, the risk assessment theory is introduced into economic dispatch. Moreover, a new knowledge transfer bacteria foraging optimization(TBFO)algorithm is proposed for risk based economic dispatch, which is developed by combining bacteria foraging optimization(BFO)and the try-error mechanism of Q-learning. Besides, the knowledge matrix is updated by multiple agents with cooperative collaboration, in which the knowledge extension is adopted to handle the curve of dimension. After obtaining all the optimal knowledge matrices, the convergence of the online learning can be accelerated by knowledge transfer. The performance of TBFO has been fully tested for risk based economic dispatch on the IEEE RTS-79. The simulation demonstrates that the convergence rate of TBFO can be approximately 9 to 20 times faster than that of classical intelligent algorithm, while the quality of the obtained optimal solution can be guaranteed. Hence, it is suitable for fast risk based economic dispatch of large-scale and complex power grid. This work is supported by National Basic Research Program of China(973 Program)(No. 2013CB228205)and National Natural Science Foundation of China(No. 51477055).
Keywords:knowledge transfer   bacteria foraging   reinforcement learning   risk dispatch
本文献已被 CNKI 等数据库收录!
点击此处可从《电力系统自动化》浏览原始摘要信息
点击此处可从《电力系统自动化》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号