首页 | 本学科首页   官方微博 | 高级检索  
     

动态电源管理的随机切换模型与在线优化
引用本文:江琦, 奚宏生, 殷保群. 动态电源管理的随机切换模型与在线优化. 自动化学报, 2007, 33(1): 66-71. doi: 10.1360/aas-007-0066
作者姓名:江琦  奚宏生  殷保群
作者单位:1.中国科学技术大学自动化系 合肥 230027
基金项目:国家自然科学基金;国家高技术研究发展计划(863计划);安徽省自然科学基金
摘    要:考虑系统参数未知情况下的动态电源管理问题,提出一种基于强化学习的在线策略优化算法. 通过建立事件驱动的随机切换分析模型,将动态电源管理问题转化为带约束的Markov 决策过程的策略优化问题. 利用此模型的动态结构特性,结合在线学习估计梯度与随机逼近改进策略,提出动态电源管理策略的在线优化算法.随机切换模型对电源管理系统的动态特性描述精确,在线优化算法自适应性强,运算量小,精度高,具有较高的实际应用价值.

关 键 词:动态电源管理   Markov决策过程   强化学习   梯度估计   随机逼近   在线优化
收稿时间:2005-10-20
修稿时间:2006-07-15

Stochastic Switching Model and Policy Optimization Online for Dynamic Power Management
JIANG Qi, XI Hong-Sheng, YIN Bao-Qun. Stochastic Switching Model and Policy Optimization Online for Dynamic Power Management. ACTA AUTOMATICA SINICA, 2007, 33(1): 66-71. doi: 10.1360/aas-007-0066
Authors:JIANG Qi  XI Hong-Sheng  YIN Bao-Qun
Affiliation:1. Department of Automation, University of Science and Technology of China, Hefei, 230027
Abstract:A reinforcement learning based online optimization algorithm is presented for dynamic power management with unknown system parameters. First an event-driven stochastic switching model is introduced to formulate dynamic power management problem as a constrained policy optimization problem. Then by utilizing the features of this model an online optimization algorithm that combines policy gr.,adient estimation and stochastic approximation is derived. The stochastic switching model captures the power-managed system behaves accurately. The optimization algorithm is adaptive, and can achieve global optimum with less computational cost. Simulation results demonstrate the effectiveness of the proposed approach.
Keywords:Dynamic power management   Markov decision processes   reinforcement learning   gradient estimation   stochastic approximation   on-line optimization
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号