首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于层次聚类的全局孤立点识别方法*
引用本文:梁斌梅,韦琳娜,宋庆祯.一种基于层次聚类的全局孤立点识别方法*[J].计算机应用研究,2011,28(5):1731-1733.
作者姓名:梁斌梅  韦琳娜  宋庆祯
作者单位:1. 广西大学,数学与信息科学学院,南宁,530004;四川大学,计算机学院,成都,610065
2. 广西大学,数学与信息科学学院,南宁,530004
基金项目:广西大学科研基金资助项目(XJZ100258)
摘    要:现有的孤立点检测算法在通用性、有效性、用户友好性及处理高维大数据集的性能还不完善,为此提出一种快速有效的基于层次聚类的全局孤立点检测方法。该方法基于层次聚类的结果,根据聚类树和距离矩阵可视化判断数据孤立程度,并确定孤立点数目。从聚类树自顶向下,无监督地去除孤立点。仿真实验验证了方法能快速有效识别全局孤立点,具有用户友好性,适用于不同形状的数据集,可用于大型高维数据集的孤立点检测。

关 键 词:孤立点检测    层次聚类    数据挖掘    全局孤立点
收稿时间:2010/10/11 0:00:00
修稿时间:2011/4/14 0:00:00

Global outlier detection based on hierarchical clustering
LIANG Bin-mei,WEI Lin-n,SONG Qing-zhen.Global outlier detection based on hierarchical clustering[J].Application Research of Computers,2011,28(5):1731-1733.
Authors:LIANG Bin-mei  WEI Lin-n  SONG Qing-zhen
Affiliation:(1. College of Mathematics & Information Science, Guangxi University, Nanning 530004, China;2. College of Computer Science, Sichuan University, Chengdu 610065, China)
Abstract:The existing outlier detection algorithms should be improved due to their versatility,effectiveness,user-friendliness,and the performance in processing high-dimensional and large databases.A fast and effective hierarchical clustering based global outlier detection approch is proposed in this paper. Agglomerative hierarchical clustering is performed firstly,and then the isolated degree of the data can be visually judged and the number of the outliers can be determined based on the clustering tree and the distance matrix.After that, the outliers is identified unsupervisedly from the top to down of the clustering tree.Experimental results show that,this approch can identify global outliers fastly and effectively,and is user-friendly and capable at datasets of various shapes.Experiments also illustrate that this approach is suitable for use on high-dimensional and large databases.
Keywords:outlier detection  hierarchical clustering  data mining  global outlier
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号