Sequential anomaly detection based on temporal-difference learning: Principles,models and case studies期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Sequential anomaly detection based on temporal-difference learning: Principles,models and case studies

Authors:	Xin Xu

Affiliation:	1. Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, Canada, T6G 2V4;2. Department of Electrical and Computer Engineering, King Abdulaziz University, Jeddah, 21589, Saudi Arabia;3. Systems Research Institute, Polish Academy of Sciences, Newelska 6, 01-447 Warsaw, Poland;4. AQL Management Consulting Inc., Edmonton, Alberta T6J 2R8, Canada;1. Department of Computer Science, Makerere University, PO Box 7062, Kampala, Uganda;2. Department of Computer Science, Tokyo Institute of Technology, Tokyo 152-8552, Japan;1. Institute of Information Science, Beijing Jiaotong University, Beijing Key Laboratory of Advanced Information Science and Network Technology, Beijing 100044, China;2. School of Software Engineering, Beijing Jiaotong University, Beijing 100044, China

Abstract:	Anomaly detection is an important problem that has been popularly researched within diverse research areas and application domains. One of the open problems in anomaly detection is the modeling and prediction of complex sequential data, which consist of a series of temporally related behavior patterns. In this paper, a novel sequential anomaly detection method based on temporal-difference (TD) learning is proposed, where the anomaly detection problem of multi-stage cyber attacks is considered as an application case. A Markov reward process model is presented for the anomaly detection and alarming process of sequential data and it is verified that when the reward function is properly defined, the anomaly probabilities of sequential behaviors are equivalent to the value functions of the Markov reward process. Therefore, TD learning algorithms in the reinforcement learning literature can be used to efficiently construct anomaly detection models of complex sequential behaviors by estimating the value functions of the Markov reward process. Compared with other machine learning methods for anomaly detection, the proposed approach has the advantage of simplified labeling process using delayed evaluative signals and the prediction accuracy can be improved even if labeled training data are limited. Based on the experimental results on intrusion detection of host computers using system call data, it was shown that the proposed anomaly detection method can achieve higher or at least comparable detection accuracies than other approaches including SVMs, and HMMs.

Keywords:
本文献已被 ScienceDirect 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏