首页 | 本学科首页   官方微博 | 高级检索  
     

基于事件驱动深度强化学习的建筑热舒适控制
引用本文:李竹,傅启明,丁正凯,刘璐,张颖,陈建平. 基于事件驱动深度强化学习的建筑热舒适控制[J]. 计算机应用研究, 2024, 41(2)
作者姓名:李竹  傅启明  丁正凯  刘璐  张颖  陈建平
作者单位:苏州科技大学,苏州科技大学,苏州科技大学,苏州科技大学,苏州科技大学,苏州科技大学
基金项目:国家重点研发计划资助项目(2020YFC2006602);国家自然科学基金资助项目(62102278,62172324,61876217,61876121);江苏省高等学校自然科学研究项目(21KJA520005);江苏省重点研发计划资助项目(BE2020026);江苏省自然科学基金资助项目(BK20190942);江苏省研究生教育教学改革项目
摘    要:住宅暖通空调系统通常耗用大量能源,同时也极大地影响居住者的热舒适性。目前,强化学习广泛应用于优化暖通空调系统,然而这一方法需要投入大量时间和数据资源。为了解决该问题,提出了一个新的基于事件驱动的马尔可夫决策过程(event-driven Markov decision process,ED-MDP)框架,并在此基础上,提出了基于事件驱动的深度确定性策略梯度(event-driven deep deterministic policy gradient,ED-DDPG)方法,通过事件触发优化控制,结合强化学习算法求解最优控制策略。实验结果显示,与基准方法相比,ED-DDPG在提升学习速度和减少决策频率方面表现出色,并在节能和维持热舒适方面取得了显著成果。经过实验验证,该方法在优化住宅暖通空调控制方面展现出强大的鲁棒性和适应性。

关 键 词:强化学习   事件驱动   暖通空调   住宅建筑   热舒适
收稿时间:2023-06-14
修稿时间:2024-01-14

Event-driven reinforcement learning thermal comfort control for residential buildings
Li Zhu,Fu Qiming,Ding Zhengkai,Liu Lu,Zhang Ying and Chen Jianping. Event-driven reinforcement learning thermal comfort control for residential buildings[J]. Application Research of Computers, 2024, 41(2)
Authors:Li Zhu  Fu Qiming  Ding Zhengkai  Liu Lu  Zhang Ying  Chen Jianping
Affiliation:Suzhou University of Science and Technology,,,,,
Abstract:Residential HVAC systems typically constitute a substantial portion of energy consumption and exert a significant influence on occupants'' thermal comfort. At present, reinforcement learning is widely employed to optimize HVAC systems; however, this approach necessitates a substantial investment of time and data resources. To address this issue, this paper proposed a novel framework based on an event-driven Markov decision process(ED-MDP) and further introduce an event-driven deep deterministic policy gradient(ED-DDPG) method. This approach amalgamated reinforcement learning algorithms to deduce optimal control policies through event-triggered optimization. The experimental results demonstrate that ED-DDPG excels in enhancing learning speed and reducing decision frequency compared to the benchmark method. Furthermore, it attains notable accomplishments in energy conservation and sustaining thermal comfort. Following comprehensive testing and validation, the method showcases robustness and adaptability in optimizing residential HVAC control.
Keywords:reinforcement learning   event-driven   HAVC   residential buildings   thermal comfort
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号