首页 | 本学科首页   官方微博 | 高级检索  
     

从多角度分析现有聚类算法
引用本文:钱卫宁,周傲英.从多角度分析现有聚类算法[J].软件学报,2002,13(8):1382-1394.
作者姓名:钱卫宁  周傲英
作者单位:复旦大学,智能信息处理开放实验室,上海,200433;复旦大学,计算机科学系,上海,200433
基金项目:Supported by the National Grand Fundamental Research 973 Program of China under Grant No.G1998030414 (国家重点基础研究发展规划973项目); the National Research Foundation for the Doctoral Program of Higher Education of China under Grant No.99038 (国家教育部博士点基金)
摘    要:聚类是数据挖掘中研究的重要问题之一.聚类分析就是把数据集分成簇,以使得簇内数据尽量相似,簇间数据尽量不同.不同的聚类方法采用不同的相似测度和技术.从以下3个角度分析现有流行聚类算法: (1)聚类尺度; (2)算法框架; (3)簇的表示.在此基础上,分析了一些综合或概括了一些其他方法的算法.由于分析从3个角度进行,所提出的方法能够涵盖,并区分绝大多数现有聚类算法.所做的工作是自调节聚类方法以及聚类基准测试研究的基础.

关 键 词:数据挖掘  聚类分析  算法
收稿时间:9/3/2001 12:00:00 AM
修稿时间:2002/2/25 0:00:00

Analyzing Popular Clustering Algorithms from Different Viewpoints
QIAN Wei-ning and ZHOU Ao-ying.Analyzing Popular Clustering Algorithms from Different Viewpoints[J].Journal of Software,2002,13(8):1382-1394.
Authors:QIAN Wei-ning and ZHOU Ao-ying
Abstract:Clustering is widely studied in data mining community. It is used to partition data set into clusters so that intra-cluster data are similar and inter-cluster data are dissimilar. Different clustering methods use different similarity definition and techniques. Several popular clustering algorithms are analyzed from three different viewpoints: (1) clustering criteria, (2) cluster representation, and (3) algorithm framework. Furthermore, some new built algorithms, which mix or generalize some other algorithms, are introduced. Since the analysis is from several viewpoints, it can cover and distinguish most of the existing algorithms. It is the basis of the research of self-tuning algorithm and clustering benchmark.
Keywords:data mining  clustering  algorithm
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号