首页 | 官方网站   微博 | 高级检索  
     

基于层次聚类的孤立点检测方法
引用本文:梁斌梅.基于层次聚类的孤立点检测方法[J].计算机工程与应用,2009,45(32):117-119.
作者姓名:梁斌梅
作者单位:广西大学 数学与信息科学学院,南宁 530004
摘    要:孤立点检测是数据挖掘过程的重要环节,提出了基于层次聚类的孤立点检测(ODHC)方法。ODHC方法基于层次聚类结果进行分析,对距离矩阵按簇间距离从大到小检测孤立点,可检测出指定离群程度的孤立点,直到达到用户对数据的集中性要求。该方法适用于多维数据集,且算法原理直观,用户友好,对孤立点的检测准确率较高。在iris、balloon等数据集上的仿真实验结果表明,ODHC方法能有效地识别孤立点,是一种简单实用的孤立点检测方法。

关 键 词:孤立点检测  层次聚类  数据预处理  数据挖掘  
收稿时间:2009-8-14
修稿时间:2009-9-18  

Outlier detection method based on hierarchical clustering
LIANG Bin-mei.Outlier detection method based on hierarchical clustering[J].Computer Engineering and Applications,2009,45(32):117-119.
Authors:LIANG Bin-mei
Affiliation:College of Mathematics and Information Science,Guangxi University,Nanning 530004,China
Abstract:Outlier detection is an important step of data mining,a new Outlier Detection method based on Hierarchical Clustering (ODHC) is proposed.ODHC method takes an analysis based on the results of hierarchical clustering,and detects outliers by the distance matrix in decending order of distance between clusters.Outlier in the specified degree of isolation can be detected, until it meets the user's requirement of data-intensive.This method is applicable to multi-dimensional data sets,and the algorithm is principle-intuitive,user-friendly,and high accuracy in outlier detection.Experimental results on iris and balloon data sets showthat ODHC method can effectively identify the outliers,and is a simple and applicable method of outliers detection.
Keywords:outlier detection  hierarchical clustering  data preprocessing  data mining
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号