半监督的改进K-均值聚类算法

doi:10.3778/j.issn.1002-8331.2009.28.041

计算机工程与应用 ›› 2009, Vol. 45 ›› Issue (28): 137-139.DOI: 10.3778/j.issn.1002-8331.2009.28.041

• 数据库、信号与信息处理 • 上一篇下一篇

半监督的改进K-均值聚类算法

汪军^1，2，王传玉²，周鸣争¹

1.安徽工程科技学院计算机科学与工程系，安徽芜湖 241000
2.安徽工程科技学院应用数理系，安徽芜湖 241000

收稿日期:2009-04-08 修回日期:2009-06-11 出版日期:2009-10-01 发布日期:2009-10-01
通讯作者: 汪军

Semi-supervised improved K-means clustering algorithm

WANG Jun^1，2，WANG Chuan-yu²，ZHOU Ming-zheng¹

1.Department of Computer Science & Engineering，Anhui University of Technology and Science，Wuhu，Anhui 241000，China
2.Department of Math & Physics，Anhui University of Technology and Science，Wuhu，Anhui 241000，China

Received:2009-04-08 Revised:2009-06-11 Online:2009-10-01 Published:2009-10-01
Contact: WANG Jun

摘要/Abstract

摘要： K-均值聚类算法必须事先获取聚类数目，并且随机地选取聚类初始中心会造成聚类结果不稳定，容易在获得一个局部最优值时终止。提出了一种基于半监督学习理论的改进K-均值聚类算法，利用少量标签数据建立图的最小生成树并迭代分裂获取K-均值聚类算法所需要的聚类数和初始聚类中心。在IRIS数据集上的实验表明，尽管随机样本构造的生成树不同，聚类中心也不同，但聚类是一致且稳定的，迭代的次数较少，验证了该文算法的有效性。

关键词: 半监督学习, K-均值聚类, 标签样本, 最小生成树

Abstract: K-means clustering algorithm acquires the number of clusters in advance.The random selection of the initial cluster centers will result in the instability and K-means clustering algorithm will be terminated in access to a local optimum value.In order to solve the problem，the improved K-means clustering algorithm based on semi-supervised learning theory obtains the number of clustering and initial clustering centers after building minimum spanning tree used by few label samples and splitting it iteratively.Although minimum spanning tree making up of random samples and initial clustering centers are different，the clustering is consistent and stable；the iteration is less than traditional K-means algorithm.It proves that the semi-supervised improved K-means algorithm is effective.

Key words: semi-supervised learning, K-means clustering, labeled sample, minimum spanning tree

中图分类号:

TP391.4

汪军^1，2，王传玉²，周鸣争¹. 半监督的改进K-均值聚类算法[J]. 计算机工程与应用, 2009, 45(28): 137-139.

WANG Jun^1，2，WANG Chuan-yu²，ZHOU Ming-zheng¹. Semi-supervised improved K-means clustering algorithm[J]. Computer Engineering and Applications, 2009, 45(28): 137-139.

[1]	邹承明，胡佑璞. 引入生成对抗网络的室外场景单目深度估计[J]. 计算机工程与应用, 2021, 57(6): 176-183.
[2]	米源，唐恒亮. 基于图卷积网络的谣言鉴别研究[J]. 计算机工程与应用, 2021, 57(13): 161-167.
[3]	唐焕玲，刘艳红，郑涵，窦全胜，鲁明羽. 融合SLDA主题模型的不均衡文本分类方法[J]. 计算机工程与应用, 2021, 57(12): 144-154.
[4]	宋丽丽，李彬，赵俊雅，刘国峰. 正态重采样的改进行人再识别度量学习算法[J]. 计算机工程与应用, 2020, 56(8): 158-165.
[5]	韩嵩，韩秋弘. 半监督学习研究的述评[J]. 计算机工程与应用, 2020, 56(6): 19-27.
[6]	邓宇，谌贵辉，李忠兵，张军豪，亢宇欣，夏旭洪. 基于颜色与边缘融合的非局部立体匹配算法[J]. 计算机工程与应用, 2020, 56(10): 199-204.
[7]	宋森森1，贾振红1，杨杰2，Nikola KASABOV3. 结合Ostu阈值法的最小生成树图像分割算法[J]. 计算机工程与应用, 2019, 55(9): 178-183.
[8]	杨烁，刘兵，周勇. 基于稀疏编码的半监督低秩核学习算法[J]. 计算机工程与应用, 2019, 55(7): 175-181.
[9]	张璞1，柴变芳1，张静1，李文斌2. 半监督属性网络表示学习方法[J]. 计算机工程与应用, 2019, 55(12): 117-123.
[10]	王玉业，陈健美. 安全的半监督方法的协同过滤推荐算法[J]. 计算机工程与应用, 2018, 54(8): 107-111.
[11]	吴明胜，邓晓刚. 基于Tri-DE-ELM的半监督模式分类方法研究[J]. 计算机工程与应用, 2018, 54(3): 109-114.
[12]	卢月明1，王亮1，仇阿根1，张用川1，2，赵阳阳1. 基于半监督学习的克里金插值方法[J]. 计算机工程与应用, 2018, 54(22): 265-270.
[13]	陈玉琦1，雷刚1，姚明海2，易玉根1. 基于局部约束的自适应图标签传递方法[J]. 计算机工程与应用, 2018, 54(20): 14-19.
[14]	李村合，朱红波. 基于半监督学习的多示例多标记E-MIMLSVM+算法[J]. 计算机工程与应用, 2018, 54(2): 149-154.
[15]	曹戴，陈丽芳. 基于杰卡德度量的智能拼图改进算法[J]. 计算机工程与应用, 2018, 54(2): 188-192.

半监督的改进K-均值聚类算法

Semi-supervised improved K-means clustering algorithm

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics