首页 | 本学科首页   官方微博 | 高级检索  
     

局部显著单元高维聚类算法
引用本文:宗瑜, 李明楚, 徐贯东, 张彦春. 局部显著单元高维聚类算法[J]. 电子与信息学报, 2010, 32(11): 2707-2712. doi: 10.3724/SP.J.1146.2009.01589
作者姓名:宗瑜  李明楚  徐贯东  张彦春
作者单位:1. 大连理工大学软件学院,大连,116621;维多利亚大学信息应用中心,墨尔本,VIC3011
2. 大连理工大学软件学院,大连,116621
3. 维多利亚大学信息应用中心,墨尔本,VIC3011
基金项目:国家自然科学重点基金,国家973计划项目,澳大利亚ARC项目,安徽省教育厅重点项目
摘    要:以等宽或随机宽度网格密度单元为基础的高维聚类算法不能保证复杂数据集中的聚类结果的质量。该文在核密度估计和空间统计理论的基础上,给出一种基于局部显著单元的高维聚类算法来处理复杂数据的高维聚类问题。该方法以局部核密度估计和空间统计理论为基础定义了局部显著单元结构来捕获局部数据分布;设计了能快速发现覆盖数据分布的局部显著区域的贪婪算法;对具有相同属性子集的局部显著单元执行Single-linkage算法发现其中的聚类结果。实验结果表明,以局部显著单元为基础的高维聚类算法能够发现复杂数据集中隐含的高质量聚类结果。

关 键 词:聚类分析   高维聚类算法   核密度估计   局部显著单元
收稿时间:2009-12-11
修稿时间:2010-05-20

High Dimensional Clustering Algorithm Based on Local Significant Units
Zong Yu, Li Ming-Chu, Xu Guan-Dong, Zhang Yan-Chun. High Dimensional Clustering Algorithm Based on Local Significant Units[J]. Journal of Electronics & Information Technology, 2010, 32(11): 2707-2712. doi: 10.3724/SP.J.1146.2009.01589
Authors:Zong Yu  Li Ming-chu  Xu Guan-dong  Zhang Yan-chun
Affiliation:(School of Software, Dalian University of Technology, Dalian 116621, China)
(Center of Applied Information, Victoria University, Melbourne VIC3011, Australia)
Abstract:High dimensional clustering algorithm based on equal or random width density grid cannot guarantee high quality clustering results in complicated data sets. In this paper, a High dimensional Clustering algorithm based on Local Significant Unit (HC_LSU) is proposed to deal with this problem, based on the kernel estimation and spatial statistical theory. Firstly, a structure, namely Local Significant Unit (LSU) is introduced by local kernel density estimation and spatial statistical test; secondly, a greedy algorithm named Greedy Algorithm for LSU (GA_LSU) is proposed to quickly find out the local significant units in the data set; and eventually, the single-linkage algorithm is run on the local significant units with the same attribute subset to generate the clustering results. Experimental results on 4 synthetic and 6 real world data sets showed that the proposed high-dimensional clustering algorithm, HC_LSU, could effectively find out high quality clustering results from the highly complicated data sets.
Keywords:Clustering analysis  High dimensional Clustering (HC) algorithm  Kernel density estimation  Local Significant Unit (LSU)
本文献已被 万方数据 等数据库收录!
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号