首页 | 本学科首页   官方微博 | 高级检索  
     

智能电网中基于增强学习的动态价格优化算法
引用本文:曹俊,孙莹莹,赵航.智能电网中基于增强学习的动态价格优化算法[J].太赫兹科学与电子信息学报,2023,21(1):112-118.
作者姓名:曹俊  孙莹莹  赵航
作者单位:国网河南省电力公司 驻马店供电公司,河南 驻马店 463000
摘    要:动态的电费价格是驱使消费者改变用电消费模式的有效手段,为此,提出基于增强学习的动态价格优化(RLODP)算法。RLODP算法结合电力服务商的利润和消费者的用电成本,对电网负载进行管理;利用增强学习算法,电力服务商自适应地决策零售价格,将动态价格问题转化为离散有限马尔可夫决策过程(MDP),再利用Q-学习算法解决该决策过程。实验结果表明,提出的RLODP算法减少了消费者的用电成本,实现了电网市场中电力供应与需求之间的平衡。

关 键 词:智能电网  动态需求  电价  增强学习  离散有限马尔可夫决策过程
收稿时间:2020/4/28 0:00:00
修稿时间:2020/10/1 0:00:00

Reinforcement Learning-based Optimizing Dynamic Pricing algorithm in smart grid
CAO Jun,SUN Yingying,ZHAO Hang.Reinforcement Learning-based Optimizing Dynamic Pricing algorithm in smart grid[J].Journal of Terahertz Science and Electronic Information Technology,2023,21(1):112-118.
Authors:CAO Jun  SUN Yingying  ZHAO Hang
Abstract:Dynamic pricing is one of the most effective ways to encourage customers to change their consumption pattern. Therefore, Reinforcement Learning-based Optimizing Dynamic Pricing(RLODP) algorithm is proposed for energy management in a hierarchical electricity market by considering both service provider''s profit and customers'' costs. Using Reinforcement Learning, the SP can adaptively determine the retail electricity price. Dynamic pricing problem is formulated as a discrete finite Markov Decision Process(MDP), and Q-learning is adopted to solve this decision-making problem. Simulation results show that the RLODP algorithm can reduce energy costs for customers, balance the energy supply and the demands in the electricity market.
Keywords:smart grid  demand response  electricity price  Reinforcement Learning  discrete finite Markov Decision Process
点击此处可从《太赫兹科学与电子信息学报》浏览原始摘要信息
点击此处可从《太赫兹科学与电子信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号