首页 | 本学科首页   官方微博 | 高级检索  
     

基于深度强化学习的资源受限条件下的DIDS任务调度优化方法
引用本文:赵旭,黄光球,江晋,李巾.基于深度强化学习的资源受限条件下的DIDS任务调度优化方法[J].控制与决策,2022,37(11):3052-3057.
作者姓名:赵旭  黄光球  江晋  李巾
作者单位:西安建筑科技大学 管理学院,西安 710055;西安工程大学 电子信息学院,西安 710048;西安工程大学 人文学院,西安 710048;陕西省社会科学院,西安 710061
基金项目:国家自然科学基金项目(71874134);西安市科技计划项目(21XJZZ0024);陕西省教育厅专项项目(20JX014).
摘    要:在节点性能有限的边缘计算环境下进行分布式入侵检测系统(distributed intrusion detection system,DIDS)的任务分配,是一种典型的资源受限任务调度问题.针对该问题,提出基于深度强化学习的DIDS低负载任务调度方案.该方案将任务调度过程描述为马尔科夫决策过程(Markov decision process,MDP)并建立模型的相关空间和价值函数,找到保持DIDS低负载状态的最优策略.针对状态和动作空间过大且高维连续的问题,提出通过深度循环神经网络进行函数拟合.实验表明,所提出方案可使DIDS在网络变化中动态调节调度策略,保持系统整体的低负载,而安全指标没有明显降低.

关 键 词:资源受限  任务调度  深度强化学习  深度循环神经网络  入侵检测  边缘计算

An optimization method for DIDS task scheduling under resource- constrained conditions based on deep reinforcement learning
ZHAO Xu,HUANG Guang-qiu,JIANG Jin,LI Jin.An optimization method for DIDS task scheduling under resource- constrained conditions based on deep reinforcement learning[J].Control and Decision,2022,37(11):3052-3057.
Authors:ZHAO Xu  HUANG Guang-qiu  JIANG Jin  LI Jin
Affiliation:College of Management,Xián University of Architecture and Technology,Xián 710055,China;School of Electronic and Information,Xián Polytechnic University,Xián 710048,China;College of Humanities,Xián Polytechnic University,Xián 710048,China; Shaanxi Academy of Social Sciences,Xián 710061,China
Abstract:The task assignment of distributed intrusion detection systems(DIDS) in the edge computing environment with limited node performance is a typical resource-constrained task scheduling problem. To solve this problem, a DIDS low-load task scheduling scheme based on deep reinforcement learning is proposed. The task scheduling process is first described as a Markov decision process and the relevant space and value function of the model are established to find the optimal strategy for maintaining the low-load state of the DIDS. To solve the problem of excessively large action space and high-dimensional continuity, a deep recurrent neural network is proposed to perform function fitting. The experimental results show that the proposed scheme enables the DIDS to dynamically adjust the scheduling strategy during network changes, keeping the overall system load low, and the safety indicators are not significantly reduced.
Keywords:
点击此处可从《控制与决策》浏览原始摘要信息
点击此处可从《控制与决策》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号