首页 | 本学科首页   官方微博 | 高级检索  
     

层次聚类的簇集成方法研究
引用本文:李凯,王兰.层次聚类的簇集成方法研究[J].计算机工程与应用,2010,46(27):120-123.
作者姓名:李凯  王兰
作者单位:河北大学 数学与计算机学院,河北省机器学习与计算智能实验室,河北 保定 071002
基金项目:国家自然科学基金,河北省自然科学基金 
摘    要:聚类集成比单个聚类方法具有更高的鲁棒性和精确性,它主要由两部分组成,即个体成员的产生和结果的融合。针对聚类集成,首先用k-means聚类算法得到个体成员,然后使用层次聚类中的单连接法、全连接法与平均连接法进行融合。为了评价聚类集成方法的性能,实验中使用了ARI(Adjusted Rand Index)。实验结果表明,平均连接法的聚类集成性能优于单连接法和全连接法。研究并讨论了融合方法的聚类正确率和集成规模的关系。

关 键 词:聚类集成  融合函数  聚类  ARI
收稿时间:2009-3-3
修稿时间:2009-5-11  

Research on cluster ensembles methods based on hierarchical clustering
LI Kai,WANG Lan.Research on cluster ensembles methods based on hierarchical clustering[J].Computer Engineering and Applications,2010,46(27):120-123.
Authors:LI Kai  WANG Lan
Affiliation:School of Mathematic and Computer,HeBei University,Key Lab in Machine Learning and Computational Intelligence of Hebei Province,Baoding,Hebei 071002,China
Abstract:Cluster ensembles method is considered as a robust and accurate alternative to single clustering runs.It mainly consists of both generation of individual member and fusion methods.In this paper,the cluster ensembles are studied where individual members are obtained based on k-means clustering algorithm and fusion method of hierarchical clustering is used. Three consensus functions, which are single linkage, complete linkage and average linkage, respectively, is studied and discussed in hierarchical clustering fusion.For evaluating performance of cluster ensembles,Adjusted Rand Index is considered. Experimental results show that performance of cluster ensembles with the average linkage is superior to one with single linkage and complete linkage.Moreover, the relationship between accuracy and ensemble size of the three fusion methods is also studied and discussed.
Keywords:ARI
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号