首页 | 本学科首页   官方微博 | 高级检索  
     

改进灰狼优化算法的K-Means文本聚类
引用本文:潘成胜,张斌,吕亚娜,杜秀丽,邱少明.改进灰狼优化算法的K-Means文本聚类[J].计算机工程与应用,2021,57(1):188-193.
作者姓名:潘成胜  张斌  吕亚娜  杜秀丽  邱少明
作者单位:大连大学 通信与网络重点实验室,辽宁 大连 116622
基金项目:中央军委装备发展部领域基金
摘    要:针对K-Means算法在文本聚类过程中易陷入局部最优,造成文本聚类结果不准确的问题,提出了一种基于改进灰狼优化算法的K-Means文本聚类方法。在对文本数据进行分词、去停用词、特征提取以及文本向量化后,通过免疫克隆选择选出精英个体,并对精英个体进行深度探索以增加灰狼种群的多样性,避免早熟收敛现象的发生;将粒子群位置更新思想与灰狼位置更新结合,降低灰狼优化算法陷入局部极值的风险;与K-Means算法结合进行文本聚类。所提算法与K-Means算法、GWO-KMeans以及IPSK-Means算法相比,其准确率、召回率和F值平均都有明显提高,文本聚类结果更可靠。

关 键 词:K-Means算法  文本聚类  灰狼优化算法  免疫克隆  粒子群  

K-Means Text Clustering Based on Improved Gray Wolf Optimization Algorithm
PAN Chengsheng,ZHANG Bin,LYU Yana,DU Xiuli,QIU Shaoming.K-Means Text Clustering Based on Improved Gray Wolf Optimization Algorithm[J].Computer Engineering and Applications,2021,57(1):188-193.
Authors:PAN Chengsheng  ZHANG Bin  LYU Yana  DU Xiuli  QIU Shaoming
Affiliation:Key Laboratory of Communication and Network, Dalian University, Dalian, Liaoning 116622, China
Abstract:Focusing the issue of K-Means algorithm is easy to fall into the local optimum during the text clustering process,which results in inaccurate text clustering results.The K-Means text clustering method based on the improved gray wolf optimization algorithm is proposed.After word segmentation,de-stopping,feature extraction,and text vectorization of text data,the elite individuals are selected through immune cloning,and the elite individuals are explored in depth to increase the diversity of the gray wolf population and avoid premature convergence.It combines the particle swarm location update idea with the gray wolf location update to reduce the risk of the gray wolf optimization algorithm falling into local extremes.Finally,improved gray wolf optimization algorithm is combined with the K-Means algorithm for text clustering.Compared with the K-Means algorithm,GWO-KMeans and IPSK-Means algorithm,the proposed algorithm has significantly improved accuracy,recall and F-value average,respectively,the text clustering result is more reliable.
Keywords:K-Means algorithm  text clustering  gray wolf optimization  immune clone  particle swarm
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号