首页 | 本学科首页   官方微博 | 高级检索  
     

基于改进人工蜂群算法与MapReduce的大数据聚类算法
引用本文:孙倩,陈昊,李超.基于改进人工蜂群算法与MapReduce的大数据聚类算法[J].计算机应用研究,2020,37(6):1707-1710,1764.
作者姓名:孙倩  陈昊  李超
作者单位:湖北大学信息化建设与管理处,武汉430062;湖北大学 计算机与信息工程学院,武汉430062
基金项目:湖北省教育厅科学技术研究重点项目
摘    要:针对大数据聚类算法计算效率与聚类性能较低的问题,提出了一种基于改进人工蜂群算法与MapReduce的大数据聚类算法。将灰狼优化算法与人工蜂群算法结合,同时提高人工蜂群算法的搜索能力与开发能力,该策略能够有效地提高聚类处理的性能;采用混沌映射与反向学习作为ABC种群的初始化策略,提高搜索的解质量;将聚类算法基于Hadoop的MapReduce编程模型实现,通过最小化类内距离的平方和实现对大数据的聚类处理。实验结果表明,该算法有效地提高了大数据集的聚类质量,同时加快了聚类速度。

关 键 词:数据分析  聚类算法  人工蜂群算法  灰狼优化算法  云计算  分布式计算
收稿时间:2018/11/12 0:00:00
修稿时间:2019/1/4 0:00:00

Clustering algorithm of big data based on improved artificial bee colony algorithm and MapReduce
Sun Qian,Chen Hao and Li Chao.Clustering algorithm of big data based on improved artificial bee colony algorithm and MapReduce[J].Application Research of Computers,2020,37(6):1707-1710,1764.
Authors:Sun Qian  Chen Hao and Li Chao
Affiliation:Informationization Management Department,Hubei University,,
Abstract:Aiming at the problems of low computational efficiency and low clustering performance of clustering algorithms for big data, this paper presented a clustering algorithm of big data based on the improved artificial bee colony(ABC) algorithm and MapReduce. This algorithm combined the grey wolf optimizer algorithm and ABC algorithm, and improved the exploration and exploitation of the ABC algorithm simultaneously, it could help to improve the clustering performance effectively. The algorithm utilized the chaotic map and backward learning as the initial strategy of ABC colony to improve the solution quality of search procedure. It realized the clustering algorithm based on MapReduce programming model, and realized the clustering process for big data by minimizing the quadratic sum of inner class distances. Experimental results demonstrate that the proposed algorithm improves the clustering quality of big data, and speedups the clustering procedure.
Keywords:data analysis  clustering algorithm  artificial bee colony algorithm  grey wolf algorithm  cloud computing  distributed computing
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号