首页 | 本学科首页   官方微博 | 高级检索  
     


DetectA: abrupt concept drift detection in non-stationary environments
Affiliation:1. School of Control Science and Engineering, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian 116024, China;2. School of Innovation Experiment, Dalian University of Technology,Dalian 116024, China;3. School of Software, Dalian University of Technology, Dalian 116620, China;4. Department of Internet of Things Engineering, Hohai University, Changzhou 213022, China;1. Centro de Informática, Universidade Federal de Pernambuco, Cidade Universitária, 50.740-560, Recife-PE, Brazil;2. Instituto Federal de Educação, Ciência e Tecnologia de Pernambuco, Cidade Universitária, 50.740-540, Recife-PE, Brazil
Abstract:Almost all drift detection mechanisms designed for classification problems work reactively: after receiving the complete data set (input patterns and class labels) they apply a sequence of procedures to identify some change in the class-conditional distribution – a concept drift. However, detecting changes after its occurrence can be in some situations harmful to the process under analysis. This paper proposes a proactive approach for abrupt drift detection, called DetectA (Detect Abrupt Drift). Briefly, this method is composed of three steps: (i) label the patterns from the test set (an unlabelled data block), using an unsupervised method; (ii) compute some statistics from the train and test sets, conditioned to the given class labels for train set; and (iii) compare the training and testing statistics using a multivariate hypothesis test. Based on the results of the hypothesis tests, we attempt to detect the drift on the test set, before the real labels are obtained. A procedure for creating datasets with abrupt drift has been proposed to perform a sensitivity analysis of the DetectA model. The result of the sensitivity analysis suggests that the detector is efficient and suitable for datasets of high-dimensionality, blocks with any proportion of drifts, and datasets with class imbalance. The performance of the DetectA method, with different configurations, was also evaluated on real and artificial datasets, using an MLP as a classifier. The best results were obtained using one of the detection methods, being the proactive manner a top contender regarding improving the underlying base classifier accuracy.
Keywords:Concept drift  Drift detection  Proactive approach
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号