半监督的改进K-均值聚类算法 Semi-supervised improved K-means clustering algorithm期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

半监督的改进K-均值聚类算法

引用本文：	汪军,王传玉,周鸣争.半监督的改进K-均值聚类算法[J].计算机工程与应用,2009,45(28):137-139.

作者姓名：	汪军王传玉周鸣争

作者单位：	1.安徽工程科技学院计算机科学与工程系，安徽芜湖 241000 ;2.安徽工程科技学院应用数理系，安徽芜湖 241000

基金项目：	国家自然科学基金专项基金

摘要：	K-均值聚类算法必须事先获取聚类数目,并且随机地选取聚类初始中心会造成聚类结果不稳定,容易在获得一个局部最优值时终止。提出了一种基于半监督学习理论的改进K-均值聚类算法,利用少量标签数据建立图的最小生成树并迭代分裂获取K-均值聚类算法所需要的聚类数和初始聚类中心。在IRIS数据集上的实验表明,尽管随机样本构造的生成树不同,聚类中心也不同,但聚类是一致且稳定的,迭代的次数较少,验证了该文算法的有效性。
关键词：	半监督学习 K-均值聚类标签样本最小生成树
收稿时间：	2009-4-8
修稿时间：	2009-6-11
Semi-supervised improved K-means clustering algorithm

WANG Jun,WANG Chuan-yu,ZHOU Ming-zheng.Semi-supervised improved K-means clustering algorithm[J].Computer Engineering and Applications,2009,45(28):137-139.

Authors:	WANG Jun WANG Chuan-yu ZHOU Ming-zheng

Affiliation:	1.Department of Computer Science &; Engineering，Anhui University of Technology and Science，Wuhu，Anhui 241000，China 2.Department of Math &; Physics，Anhui University of Technology and Science，Wuhu，Anhui 241000，China

Abstract:	K-means clustering algorithm acquires the number of clusters in advance.The random selection of the initial cluster centers will result in the instability and K-means clustering algorithm will be terminated in access to a local optimum value.In order to solve the problem，the improved K-means clustering algorithm based on semi-supervised learning theory obtains the number of clustering and initial clustering centers after building minimum spanning tree used by few label samples and splitting it iteratively.Although minimum spanning tree making up of random samples and initial clustering centers are different，the clustering is consistent and stable；the iteration is less than traditional K-means algorithm.It proves that the semi-supervised improved K-means algorithm is effective.

Keywords:	semi-supervised learning K-means clustering labeled sample minimum spanning tree
本文献已被维普万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏