首页 | 官方网站   微博 | 高级检索  
     

半监督的改进K-均值聚类算法
引用本文:汪军,王传玉,周鸣争.半监督的改进K-均值聚类算法[J].计算机工程与应用,2009,45(28):137-139.
作者姓名:汪军  王传玉  周鸣争
作者单位:1.安徽工程科技学院 计算机科学与工程系,安徽 芜湖 241000 ;2.安徽工程科技学院 应用数理系,安徽 芜湖 241000
基金项目:国家自然科学基金专项基金 
摘    要:K-均值聚类算法必须事先获取聚类数目,并且随机地选取聚类初始中心会造成聚类结果不稳定,容易在获得一个局部最优值时终止。提出了一种基于半监督学习理论的改进K-均值聚类算法,利用少量标签数据建立图的最小生成树并迭代分裂获取K-均值聚类算法所需要的聚类数和初始聚类中心。在IRIS数据集上的实验表明,尽管随机样本构造的生成树不同,聚类中心也不同,但聚类是一致且稳定的,迭代的次数较少,验证了该文算法的有效性。

关 键 词:半监督学习  K-均值聚类  标签样本  最小生成树
收稿时间:2009-4-8
修稿时间:2009-6-11  

Semi-supervised improved K-means clustering algorithm
WANG Jun,WANG Chuan-yu,ZHOU Ming-zheng.Semi-supervised improved K-means clustering algorithm[J].Computer Engineering and Applications,2009,45(28):137-139.
Authors:WANG Jun  WANG Chuan-yu  ZHOU Ming-zheng
Affiliation:1.Department of Computer Science &; Engineering,Anhui University of Technology and Science,Wuhu,Anhui 241000,China 2.Department of Math &; Physics,Anhui University of Technology and Science,Wuhu,Anhui 241000,China
Abstract:K-means clustering algorithm acquires the number of clusters in advance.The random selection of the initial cluster centers will result in the instability and K-means clustering algorithm will be terminated in access to a local optimum value.In order to solve the problem,the improved K-means clustering algorithm based on semi-supervised learning theory obtains the number of clustering and initial clustering centers after building minimum spanning tree used by few label samples and splitting it iteratively.Although minimum spanning tree making up of random samples and initial clustering centers are different,the clustering is consistent and stable;the iteration is less than traditional K-means algorithm.It proves that the semi-supervised improved K-means algorithm is effective.
Keywords:semi-supervised learning  K-means clustering  labeled sample  minimum spanning tree
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号