首页 | 本学科首页   官方微博 | 高级检索  
     

一类数据空间网格化聚类算法的均值近似方法
引用本文:李存华,孙志挥.一类数据空间网格化聚类算法的均值近似方法[J].软件学报,2003,14(7):1267-1274.
作者姓名:李存华  孙志挥
作者单位:东南大学,计算机科学与工程系,江苏,南京,210096
基金项目:(Supported by the National Natural Science Foundation of China under Grant No.79970092 (国家自然科学基金); the Natural Science Foundation of the Education Board of Jiangsu Province of China under Grant No.02KJB520012 (江苏省教育厅自然科学基金)
摘    要:随着聚类分析对象数据集规模的急剧增大,改进已有的算法以获得满意的效率受到越来越多的重视.讨论了一类采用数据空间网格划分的基于密度的聚类算法的均值近似方法.该方法过滤并释放位于稠密超方格中的数据项,并利用其重心点近似计算其对周围数据元素的影响因子.给出均值近似在聚类算法中的实现策略及其误差估计.均值近似方法在有效减少内存需求、大幅度降低计算复杂度的同时对聚类精确度影响甚微.实验结果验证了该方法能够取得令人满意的效果.

关 键 词:聚类  网格  基于密度的  均值近似  误差估计
收稿时间:2002/4/23 0:00:00
修稿时间:2002/11/4 0:00:00

A Mean Approximation Approach to a Class of Grid-Based Clustering Algorithms
LI Cun-Hua and SUN Zhi-Hui.A Mean Approximation Approach to a Class of Grid-Based Clustering Algorithms[J].Journal of Software,2003,14(7):1267-1274.
Authors:LI Cun-Hua and SUN Zhi-Hui
Abstract:In recent years, the explosively growing amount of data in numerous clustering tasks has attracted considerable interest in boosting the existing clustering algorithms to large datasets. In this paper, the mean approximation approach is discussed to improve a spectrum of partition-oriented density-based algorithms. This approach filters out the data objects in the crowded grids and approximates their influence to the rest by their gravity centers. Strategies on implementation issues as well as the error bound of the mean approximation are presented. Mean approximation leads to less memory usage and simplifies computational complexity with minor lose of the clustering accuracy. Results of exhaustive experiments reveal the promising performance of this approach.
Keywords:clustering  grid  density-based  mean approximation  error evaluation
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号