首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于划分的不同参数值的DBSCAN算法
引用本文:熊忠阳,孙思,张玉芳,王秀琼.一种基于划分的不同参数值的DBSCAN算法[J].计算机工程与设计,2005,26(9):2319-2321.
作者姓名:熊忠阳  孙思  张玉芳  王秀琼
作者单位:重庆大学计算机学院,重庆,400044
基金项目:重庆市科委应用基础研究基金项目(20037986).
摘    要:聚类是数据挖掘领域中一个重要的研究方向,DBSCAN是一种基于密度的聚类算法.该算法将具有足够高密度的区域划分成簇,并可以在带有“噪声”的空间数据库中发现任意形状的簇.分析DBSCAN算法发现存在如下问题:当数据分布不均匀时,由于使用统一的全局变量,使得聚类的效果差.针对这一缺陷,提出了一种基于数据划分的思想,并对各个局部数据集采取不同的参数值分别进行聚类,最后合并各局部聚类结果.实验结果表明,改进后的算法有效并可行.

关 键 词:聚类  DBSCAN  初始聚类中心  数据划分
文章编号:1000-7024(2005)09-2319-03
收稿时间:2004-09-26
修稿时间:2004-09-26

Partition-based DBSCAN algorithm with different parameter
XIONG Zhong-yang,SUN Si,ZHANG Yu-fang,WANG Xiu-qiong.Partition-based DBSCAN algorithm with different parameter[J].Computer Engineering and Design,2005,26(9):2319-2321.
Authors:XIONG Zhong-yang  SUN Si  ZHANG Yu-fang  WANG Xiu-qiong
Affiliation:Department of Computer Science, Chongqing University, Chongqing 400044, China
Abstract:Clustering is one ofthe most important research fields in data mining. DBSCAN is a density based clustering algorithm, This algorithm is capable of clustering high density areas and finding arbitrary clusters in spatial database with noise. However, when DBSCAN is analyized, it is found that when data distribution is not even, clustering quality degrades for using the same global variable. In this paper, aimming at this weakness, a data partition based algorithm is proposed, For each local dataset, different variables are adopted, and clustering is done separately. At last local clustering results are merged. The experimental result demonstrates that the improved algorithm is effective and feasible.
Keywords:clustering  DBSCAN  initial clustering center  data partition
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号