首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于局部密度的分布式聚类挖掘算法
引用本文:倪巍伟,陈 耿,吴英杰,孙志挥.一种基于局部密度的分布式聚类挖掘算法[J].软件学报,2008,19(9):2339-2348.
作者姓名:倪巍伟  陈 耿  吴英杰  孙志挥
作者单位:1. 东南大学,计算机科学与工程学院,江苏,南京,210096
2. 南京审计学院,审计信息工程实验室,江苏,南京,210029
基金项目:国家教育部高等学校博士学科点科研基金,江苏省自然科学基金
摘    要:分布式聚类挖掘技术是解决数据集分布环境下聚类挖掘问题的有效方法.针对数据水平分布情况,在已有分布式密度聚类算法DBDC(density based distributed clustering)的基础上,引入局部密度聚类和密度吸引子等概念,提出一种基于局部密度的分布式聚类算法——LDBDC(local density based distributed clustering).算法适用于含噪声数据和数据分布异常情况,对高雏数据有着良好的适应性.理论分析和实验结果表明,LDBDC算法在聚类质量和算法效率方面优于已有的DBDC算法和SDBDC(scalable dellsity-based distributed clustering)算法.算法是有效、可行的.

关 键 词:分布式聚类  局部密度聚类  局部聚类模型  密度吸引子  高维数据
收稿时间:6/7/2007 12:00:00 AM
修稿时间:2007/11/5 0:00:00

Local Density Based Distributed Clustering Algorithm
NI Wei-Wei,CHEN Geng,WU Ying-Jie and SUN Zhi-Hui.Local Density Based Distributed Clustering Algorithm[J].Journal of Software,2008,19(9):2339-2348.
Authors:NI Wei-Wei  CHEN Geng  WU Ying-Jie and SUN Zhi-Hui
Abstract:Distributed clustering is an effect method for solving the problem of clustering data located at different sites.Considering the circumstance that data is horizontally distributed,algorithm LDBDC(local density based distributed clustering)is presented based on the existeding algorithm DBDC(density based distributed clustering), which can easily fit datasets of high dimension and abnormal distribution by adopting ideas such as local density-based clustering and density attractor.Theoretical analysis and experimental results show that algorithm LDBDC outperforms DBDC and SDBDC(scalable density-based distributed clustering)in both clustering quality and efficiency.
Keywords:distributed clustering  local density based clustering  local clustering model  density attractor  high dimension data
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号