首页 | 本学科首页   官方微博 | 高级检索  
     

基于网格查询的局部离群点检测算法
引用本文:牛少章,欧毓毅,凌捷,顾国生.基于网格查询的局部离群点检测算法[J].计算机工程与应用,2019,55(17):89-94.
作者姓名:牛少章  欧毓毅  凌捷  顾国生
作者单位:广东工业大学 计算机学院,广州,510006;广东工业大学 计算机学院,广州,510006;广东工业大学 计算机学院,广州,510006;广东工业大学 计算机学院,广州,510006
基金项目:广东省科技计划基金;广州市科技计划项目
摘    要:针对基于密度的局部离群因子算法(LOF),需要计算距离矩阵来进行k近邻查寻,算法时间复杂度高,不适合大规模数据集检测的问题,提出基于网格查询的局部离群点检测算法。算法利用距离目标网格中的数据点最近的k个其他数据点,一定在该目标网格或在该目标网格的最近邻接网格中这一特性,来改进LOF算法的邻域查询操作,以此减少LOF算法在邻域查询时的计算量。实验结果证明,提出的LOGD算法在与原LOF算法具有基本相同的检测准确率的情况下,能够有效地降低离群点检测的时间。

关 键 词:局部离群因子  K近邻  距离矩阵  网格  记忆性  邻域查询

Local Outlier Detection Algorithm Based on Grid Query
NIU Shaozhang,OU Yuyi,LING Jie,GU Guosheng.Local Outlier Detection Algorithm Based on Grid Query[J].Computer Engineering and Applications,2019,55(17):89-94.
Authors:NIU Shaozhang  OU Yuyi  LING Jie  GU Guosheng
Affiliation:Faculty of Computer, Guangdong University of Technology, Guangzhou 510006, China
Abstract:In view of the density based Local Outlier Factor(LOF) algorithm, it is necessary to calculate the distance matrix for the k] nearest neighbor search, but the time complexity of this algorithm is high, and it is not suitable for the large-scale data set detection. The local outlier detection algorithm based on the grid query(LOGD) is proposed. This algorithm uses the characteristic that the nearest k] data points from the data points in the target grid, which must be memorable in the target grid or the nearest adjacent grid of the target grid, to improve the neighborhood query operation of the LOF algorithm, and reduce the distance calculation from the neighborhood query. Experimental results show that the algorithm has the same detection accuracy as the original LOF algorithm, and effectively reduces the time of outlier detection.
Keywords:local outlier factor  [k]-nearest neighbors  distance matrix  grid  memory  neighborhood query  
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号