首页 | 本学科首页   官方微博 | 高级检索  
     

基于最大局部密度间隔的特征选择方法
引用本文:娄睿,蒋烈辉,王奕森.基于最大局部密度间隔的特征选择方法[J].计算机工程与设计,2019,40(3):699-705.
作者姓名:娄睿  蒋烈辉  王奕森
作者单位:信息工程大学 四院,河南 郑州,450002;信息工程大学 四院,河南 郑州 450002;信息工程大学 数学工程与先进计算国家重点实验室,河南 郑州 450002
基金项目:国家自然科学基金;国家自然科学基金;河南省科技攻关计划
摘    要:针对虚拟机数据特点及特征筛选问题,借鉴局部异常因子算法中的"局部"思想,提出基于最大局部密度间隔的特征评估准则,通过最大化正常数据和异常数据的局部密度差异选出有效的特征子集;结合顺序后退搜索策略与提出的特征评估准则设计相应的特征选择算法,筛选出有利于分类的虚拟机特征。实验结果表明,所设计的特征选择算法能够有效处理虚拟机的类不平衡数据,筛选出重要的虚拟机数据特征,使数据的检测率和可理解性得到有效提升,相比现有算法具有更好分类效果与更强适用性,在相同条件下具有更小的计算开销。

关 键 词:类不平衡数据  特征选择  局部密度间隔  局部异常因子  评估准则

Feature selection method based on maximum local density margin
LOU Rui,JIANG Lie-hui,WANG Yi-sen.Feature selection method based on maximum local density margin[J].Computer Engineering and Design,2019,40(3):699-705.
Authors:LOU Rui  JIANG Lie-hui  WANG Yi-sen
Affiliation:(Fourth Department,Information Engineering University,Zhengzhou 450002,China;State Key Laboratory of Mathematical Engineering and Advanced Computing,Information Engineering University,Zhengzhou 450002,China)
Abstract:For the characteristics of virtual machine data and the problem of its feature selection, using the local method of local outlier factor, the feature evaluation criterion based on maximum local density margin was proposed. The effective feature subsets were picked out by maximizing the density between normal and abnormal data. Combining the proposed criterion with sequential backward search algorithm, the corresponding feature selection algorithm was designed that might single out the virtual machine features which were beneficial to classification. The test results show that the proposed feature selection algorithm can effectively deal with the class-imbalanced data and single out the important features of virtual machine, with improvement of detection rate and understandability of virtual machine data. Compared with existing algorithms of feature selection, the proposed algorithm has better classification effects and stronger applicability with less computational cost under the same conditions.
Keywords:class-imbalanced data  feature selection  local density margin  local outlier factor  evaluation criterion
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号