首页 | 本学科首页   官方微博 | 高级检索  
     

云工作流中基于多任务时序卷积网络的异常检测方法
引用本文:姚杰,程春玲,韩静,刘峥.云工作流中基于多任务时序卷积网络的异常检测方法[J].计算机应用,2021,41(6):1701-1708.
作者姓名:姚杰  程春玲  韩静  刘峥
作者单位:1. 南京邮电大学 计算机学院, 南京 210023;2. 中兴通讯股份有限公司 上海研发中心, 上海 201203
基金项目:国家重点研发计划项目(2018YFB1003702);中兴通讯股份有限公司产学研合作项目;南京邮电大学国家自然科学基金孵化项目(NY219084)。
摘    要:云计算数据中心在日常部署和运行过程中产生的大量日志可以帮助系统运维人员进行异常分析。路径异常和时延异常是云工作流中常见的异常。针对传统的异常检测方法分别对两种异常检测任务训练相应的学习模型,而忽略了两种异常检测任务之间的关联性,导致异常检测准确率下降的问题,提出了一种基于多任务时序卷积网络的日志异常检测方法。首先,基于日志流的事件模板,生成事件序列和时间序列;然后,训练基于多任务时序卷积网络的深度学习模型,该模型通过共享时序卷积网络中的浅层部分来从系统正常执行的流程中并行地学习事件和时间特征;最后,对云计算工作流中的异常进行分析,并设计了相关异常检测逻辑。在OpenStack数据集上的实验结果表明,与日志异常检测的领先算法DeepLog和基于主成分分析(PCA)的方法比较,所提方法的异常检测准确率至少提升了7.7个百分点。

关 键 词:异常检测  日志分析  时序卷积网络  多任务学习  云工作流  
收稿时间:2020-09-07
修稿时间:2020-12-17

Anomaly detection method based on multi-task temporal convolutional network in cloud workflow
YAO Jie,CHENG Chunling,HAN Jing,LIU Zheng.Anomaly detection method based on multi-task temporal convolutional network in cloud workflow[J].journal of Computer Applications,2021,41(6):1701-1708.
Authors:YAO Jie  CHENG Chunling  HAN Jing  LIU Zheng
Affiliation:1. School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing Jiangsu 210023, China;2. Shanghai Research and Development Center, Zhongxing Telecommunication Equipment Corporation, Shanghai 201203, China
Abstract:Numerous logs generated during the daily deployment and operation process in cloud computing platforms help system administrators perform anomaly detection. Common anomalies in cloud workflow include pathway anomalies and time delay anomalies. Traditional anomaly detection methods train the learning models corresponding to the two kinds of anomaly detection tasks respectively and ignore the correlation between these two tasks, which leads to the decline of the accuracy of anomaly detection. In order to solve the problems, an anomaly detection method based on multi-task temporal convolutional network was proposed. Firstly, the event sequence and time sequence were generated based on the event templates of log stream. Then, the deep learning model based on the multi-task temporal convolutional network was trained. In the model, the event and the time characteristics were learnt in parallel from the normal system execution processes by sharing the shallow layers of the temporal convolutional network. Finally, the anomalies in the cloud computing workflow were analyzed, and the related anomaly detection logic was designed. Experimental results on the OpenStack dataset demonstrate that, the proposed method improves the anomaly detection accuracy at least by 7.7 percentage points compared to the state-of-art log anomaly detection algorithm DeepLog and the method based on Principal Component Analysis (PCA).
Keywords:anomaly detection  log analysis  temporal convolutional network  multi-task learning  cloud workflow  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号