首页 | 本学科首页   官方微博 | 高级检索  
     

FDBSCAN:一种快速 DBSCAN算法
引用本文:周水庚,周傲英,金文,范晔,钱卫宁.FDBSCAN:一种快速 DBSCAN算法[J].软件学报,2000,11(6):735-744.
作者姓名:周水庚  周傲英  金文  范晔  钱卫宁
作者单位:复旦大学计算机系,上海,200433;复旦大学计算机系,上海,200433;复旦大学计算机系,上海,200433;复旦大学计算机系,上海,200433;复旦大学计算机系,上海,200433
基金项目:This research is supported by the National 973 Fundamental Research of China(国家重点基础研究计划 No.G1998030414),the National Natural Science Foundation of China (国家自然科学基金,No.6974300),and the National Doctoral Subject Foundation of China (国家博士后项目基金,No.1999024621).
摘    要:聚类分析是一门重要的技术 ,在数据挖掘、统计数据分析、模式匹配和图象处理等领域具有广泛的应用前景 .目前 ,人们已经提出了许多聚类算法 .其中 ,DBSCAN是一种性能优越的基于密度的空间聚类算法 .利用基于密度的聚类概念 ,用户只需输入一个参数 ,DBSCAN算法就能够发现任意形状的类 ,并可以有效地处理噪声 .文章提出了一种加快 DBSCAN算法的方法 .新算法以核心对象邻域中所有对象的代表对象为种子对象来扩展类 ,从而减少区域查询次数 ,降低 I/ O开销 .实验结果表明 ,FDBSCAN能够有效地

关 键 词:大规模数据库    数据挖掘    聚类    快速DBSCAN算法    代表点
收稿时间:1999/3/19 0:00:00
修稿时间:1999/6/25 0:00:00

FDBSCAN: A Fast DBSCAN Algorithm
ZHOU Shui geng,ZHOU Ao ying,JIN Wen,FAN Ye and QIAN Wei ning.FDBSCAN: A Fast DBSCAN Algorithm[J].Journal of Software,2000,11(6):735-744.
Authors:ZHOU Shui geng  ZHOU Ao ying  JIN Wen  FAN Ye and QIAN Wei ning
Abstract:Clustering is an important application area for many fields including data mining, statistical data analysis, pattern recognition, image processing, and other business applications. Up to now, many algorithms for clustering have been developed. Contributed from the database research community, DBSCAN algorithm is an outstanding representative of clustering algorithms for its good performance in clustering spatial data. Relying on a density based notion of clusters, DBSCAN is designed to discover clusters of arbitrary shape.It requires only one input parameter and supports the user in determining an appropriate value of it.In this paper,a fast DBSCAN algorithm (FDBSCAN) is developed which considerably speeds up the original DBSCAN algorithm.Unlike DBSCAN,FDBSCAN uses only a small number of representative points in a core point's neighborhood as seeds to expand the cluster such that the execution frequency of region query and consequently the I/O cost are reduced.Experimental results show that FDBSCAN is effective and efficient in clustering large-scale databases,and it is faster than the original DBSCAN algorithm by several times.
Keywords:Large scale database  data mining  clustering  fast DBSCAN algorithm  representative point
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号