首页 | 本学科首页   官方微博 | 高级检索  
     

基于增强蜂群优化与k-means的文本聚类算法
引用本文:柯钢,杨俊.基于增强蜂群优化与k-means的文本聚类算法[J].计算机应用研究,2016,33(8).
作者姓名:柯钢  杨俊
作者单位:东莞职业技术学院 计算机工程系,中山大学 信息科学与技术学院
基金项目:国家自然科学基金(61106019);东莞市社会科技发展项目(2013108101045);
摘    要:针对文本数据维度较高、空间分布稀疏及其聚类效果不佳的问题,提出一种基于增强蜂群优化搜索与k-means的高效文本聚类算法。首先为蜂群算法引入公平操作与克隆操作来提高全局搜索的能力,公平操作提高了样本多样性并增强了蜂群搜索能力,克隆操作则增强了各代之间的信息交流,提高了求解质量。最终引入k-means进行局部质心的提炼,提高聚类质量。基于文本数据集的试验结果证明,相较于其他聚类算法,本算法具有更高的聚类质量。

关 键 词:蜂群算法    公平操作  克隆操作  多样性  局部提炼  文本聚类  
收稿时间:2015/3/17 0:00:00
修稿时间:2016/6/23 0:00:00

Enhanced bee colony optimal and k-means based document clustering algorithm
KE Gang and Yang Jun.Enhanced bee colony optimal and k-means based document clustering algorithm[J].Application Research of Computers,2016,33(8).
Authors:KE Gang and Yang Jun
Affiliation:Deptof Computer Engineering,Dongguan Polytechnic,Dongguan Guangdong,School of Information Science and Technology,Sun Yat-Sen University
Abstract:Aimed at the problem that the document data has the characteristics such as high-dimensionality and sparseness, a Enhanced bee colony optimal and k-means based document clustering algorithm is proposed. Firstly, the fairness and clone operation are introduced to bee colony to improve the global search power, the individuals diversity and the search power are enhanced by fairness operation, the information communication between different iterations is enhanced by clone operation, and the solution quality is improved. At last, the clustering quality is improved by k-means which is good at local refining. Experiments results based on the documents show that the proposed algorithm has better clustering quality than the other clustering algorithm.
Keywords:Bee colony algorithm  fairness operation  diversity  local refine  document clustering
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号