首页 | 本学科首页   官方微博 | 高级检索  
     

基于深度期望Q网络算法的微电网能量管理策略
引用本文:冯昌森,张瑜,文福拴,叶承晋,张有兵. 基于深度期望Q网络算法的微电网能量管理策略[J]. 电力系统自动化, 2022, 0(3): 14-22
作者姓名:冯昌森  张瑜  文福拴  叶承晋  张有兵
作者单位:1. 浙江工业大学信息工程学院;2. 浙江大学电气工程学院
基金项目:国家自然科学基金资助项目(51777193);
摘    要:随着光伏发电在微电网中的渗透率不断提高,其发电出力的不确定性和时变性为微电网的经济运行带来了挑战.在构建经济调度模型时,就需要适当模拟不确定变量并相应地发展高效求解算法.在此背景下,文中提出能够有效计及不确定性因素的深度强化学习算法,以实时求解微电网的优化运行问题.首先,采用马尔可夫决策过程对微电网优化运行问题进行建模...

关 键 词:光伏发电  不确定性建模  深度强化学习  贝叶斯神经网络  双深度期望Q网络

Energy Management Strategy for Microgrid Based on Deep Expected Q Network Algorithm
FENG Changsen,ZHANG Yu,WEN Fushuan,YE Chengjin,ZHANG Youbing. Energy Management Strategy for Microgrid Based on Deep Expected Q Network Algorithm[J]. Automation of Electric Power Systems, 2022, 0(3): 14-22
Authors:FENG Changsen  ZHANG Yu  WEN Fushuan  YE Chengjin  ZHANG Youbing
Affiliation:(College of Information Engineering,Zhejiang University of Technology,Hangzhou 310023,China;College of Electrical Engineering,Zhejiang University,Hangzhou 310007,China)
Abstract:With the increasing penetration of photovoltaic units in microgrids,the uncertainty and time-variability of its power generation have brought challenges to the economic operation of microgrids.When constructing an economic dispatch model,it is necessary to properly simulate the uncertain variables and develop the efficient solving algorithms accordingly.In this context,this paper proposes a deep reinforcement learning algorithm that can effectively account for uncertain factors to solve the problem of optimal operation of microgrids in real time.Firstly,the Markov decision process(MDP)is used to model the optimization operation of microgrids,and the real-time reward function is used to replace the objective functions and constraint conditions,and the interaction with the environment is used to find the optimal strategy.Secondly,the uncertain learning environment is modeled with the help of Bayesian neural networks,and then the stochastic process of state transfer is effectively considered in the MDP.Therefore,a double depth expected Q network algorithm is proposed.By considering the randomness of the state transfer,the Q iteration rules of the general deep Q network algorithm are optimized to significantly improve the convergence speed of the algorithm.Finally,a case is used to verify the effectiveness of the proposed model and algorithm.
Keywords:photovoltaic generation  uncertainty modeling  deep reinforcement learning  Bayesian neural network  double deep expected Q network
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号