首页 | 本学科首页   官方微博 | 高级检索  
     

一种优化的基于网格的聚类算法
引用本文:刘俊岭,孙焕良,王大玲,牛志成.一种优化的基于网格的聚类算法[J].小型微型计算机系统,2006,27(10):1927-1930.
作者姓名:刘俊岭  孙焕良  王大玲  牛志成
作者单位:1. 沈阳建筑大学,计算中心,辽宁,沈阳,110168
2. 沈阳建筑大学,信息与控制工程学院,辽宁,沈阳,110168
3. 东北大学,信息科学与工程学院,辽宁,沈阳,110004
基金项目:国家自然科学基金;辽宁省自然科学基金;辽宁省教育厅资助项目
摘    要:聚类是数据挖掘领域中一个重要的研究课题.与其它算法相比,基于网格的聚类算法可以高效处理低维的海量数据.然而,由于划分的单元数与数据的维数呈指数增长,因此对于维数较高的数据集,生成的单元数过多,导致算法的效率较低.本文基于CD—Tree设计了新的基于网格的聚类算法,该算法的效率远高于传统的基于网格聚类算法的效率.此外,本文设计了一种剪枝优化策略,以提高算法的效率.实验表明,与传统的聚类算法相比,基于CD-Tree的聚类算法在数据集的大小及维度的可伸缩性方面均有显著提高.

关 键 词:数据挖掘  聚类分析  基于网格的算法
文章编号:1000-1220(2006)10-1927-04
收稿时间:08 10 2005 12:00AM
修稿时间:2005-08-10

Optimized Cell-based Clustering Algorithm
LIU Jun-ling,SUN Huan-liang,WANG Da-ling,NIU Zhi-cheng.Optimized Cell-based Clustering Algorithm[J].Mini-micro Systems,2006,27(10):1927-1930.
Authors:LIU Jun-ling  SUN Huan-liang  WANG Da-ling  NIU Zhi-cheng
Affiliation:1.Computer Center, Shenyang Jianzhu University, Shenyang 110168, China;2.School of Information and Control Engineering, Shenyang Jianzhu University, Shenyang 110168, China;3.School of Information Science and Engineering, Northeastern University, Shenyang 110004, China
Abstract:In data mining fields, clustering is an important issue. Comparing with other algorithms, the cell-based clustering algorithms can be applied to low dimensional data. However, in the cell-based algorithms, the number of ceils will increase exponentially with the dimensionality. So it is low efficient with high dimensionality due to a large number of cells. This paper proposes a new clustering algorithm based on CD-Tree, which improve largely the efficiency of the cell-based algorithm. In addition, to improve the efficiency of the algorithm further, we design the pruning strategy that prunes the non-dense cells before the clustering procedure. Extensive experiments on real and synthetic datasets also show that the algorithm has better scalability than other cell-based clustering algorithms.
Keywords:CD-Tree
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号