首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于层次聚类的k均值算法研究
引用本文:张红云,李萍萍.一种基于层次聚类的k均值算法研究[J].微计算机信息,2010(12).
作者姓名:张红云  李萍萍
作者单位:张家口教育学院宣化分校;装甲兵工程学院基础部;
摘    要:依据信息论的思想,对基于层次的K-均值聚类算法(HKMA)过程进行了分析,该算法首先采用层次方法对文档进行初始聚类,得到的聚类总数作为k均值算法中的k值,在此基础上,通过k均值聚类对聚类结果进行修正。实验结果表明,HKMA执行时间整体上优于k-means算法,而且随着数据量的增大执行时间的增长幅度也较小。

关 键 词:聚簇  k-means  层次方法  文本挖掘  

A K-means Clustering Algorithm based on Hierarchy
ZHANG Hong-yun LI Ping-ping.A K-means Clustering Algorithm based on Hierarchy[J].Control & Automation,2010(12).
Authors:ZHANG Hong-yun LI Ping-ping
Affiliation:ZHANG Hong-yun(Xuanhua Campus of Zhangjiakou Educational College,Xuanhua Hebei,075100,China) LI Ping-ping(Department of Fundamental Course,The Academy of Armored Forces Engineering,Beijing,10072,China)
Abstract:Probabilistic hierarchical clustering based on document information quantity.From an information theory angle,we study a K-means clustering algorithm based on hierarchy in this paper.Firstly,this algorithm classifies documents into one or more predefined categories using hierarchical methods,the total classified number is taken for the number of clusters.Secondly,it uses k-means to modify the clustering results.Experimental results showed that these algorithms have higher mining efficiency in execution time...
Keywords:Cluster  k-means  hierarchical methods  text mining  
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号