首页 | 本学科首页   官方微博 | 高级检索  
     

基于k-近邻域中心偏移的鲁棒性异常检测算法
引用本文:赵建龙,曲桦,赵季红.基于k-近邻域中心偏移的鲁棒性异常检测算法[J].北京邮电大学学报,2017,40(4):54-59.
作者姓名:赵建龙  曲桦  赵季红
作者单位:西安交通大学软件学院,西安,710049;西安交通大学软件学院,西安710049;西安交通大学电子与信息工程学院,西安710049;西安交通大学电子与信息工程学院,西安710049;西安邮电大学通信与信息工程学院,西安710061
摘    要:针对大多数基于距离和密度的异常检测算法敏感于近邻参数k的问题,提出了一种鲁棒性异常检测标准——k-近邻域中心偏移异常因子(COOF).数据结点的k-近邻域中心位置会随着近邻参数k的变化而发生迁移,鉴于异常结点要比正常结点对k-近邻域中心位置偏移量的影响更大,通过累加因递增k而产生的偏移量来表征数据结点的异常程度,并在COOF基础上实现了鲁棒性的异常检测算法.通过综合数据和真实数据的实验仿真可知,COOF不仅对近邻参数k具有鲁棒性,而且相比基于距离的k最近邻算法、基于局部距离的异常因子和基于密度的局部异常因子具有更稳定且更准确的异常检测性能.

关 键 词:异常检测  k最近邻  局部异常因子  中心偏移异常因子

Robust Outlier Detection Algorithm Based on k-Nearest Neighbor Region Center Migration
Abstract:Considering the distance-and density-based outlier detection algorithms are often sensitive to a nearest neighbor parameter k,termed k-center offset outlier factor (COOF),a robust outlier detection criterion for the characterization of abnormal degree of each data object was proposed.Each data object is included in a region within its k nearest neighbors,and the center of region will migrate with the change of nearest neighbor parameter k.In general,the variation of center offset of k nearest neighbor region is greater for an outlier than a normal object.According to this observation,for each data object,COOF is defined as the accumulation of this kind of offset when increasing the nearest neighbor parameter from one to k.Finally,the outlier detection algorithm based on COOF was also presented.Through artificial data and real data experimental simulations show that COOF is insensitive to parameter k,and has more stable and accurate outlier detection performance compared to k nearest neighbor,local distance-based outlier factor and local outlier factor,which are the distance-based method and density-based method respectively.
Keywords:outlier detection  k nearest neighbor  local outlier factor  center offset outlier factor
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号