首页 | 本学科首页   官方微博 | 高级检索  
     

大数据下不完备信息系统近似空间的并行算法
引用本文:姜麟,米允龙,王添. 大数据下不完备信息系统近似空间的并行算法[J]. 计算机工程与应用, 2014, 50(15): 101-106
作者姓名:姜麟  米允龙  王添
作者单位:昆明理工大学 理学院,昆明 650500
摘    要:上、下近似空间是粗糙理论的重要概念,解决上、下近似问题是海量数据挖掘的基础。经典的近似空间算法不适合处理海量数据,更不适合处理带缺失信息的海量数据问题。为此,通过深度分析带缺失信息的海量数据特征,结合MapReduce编程模型,提出了基于MapReduce框架下近似空间的并行算法,以处理带缺失信息的海量数据,实验结果表明了该并行算法的有效性。

关 键 词:MapReduce  数据挖掘  海量数据  粗糙集  不完备信息系统  近似空间  

Parallel algorithm for computing incomplete information systems under big data
JIANG Lin,MI Yunlong,WANG Tian. Parallel algorithm for computing incomplete information systems under big data[J]. Computer Engineering and Applications, 2014, 50(15): 101-106
Authors:JIANG Lin  MI Yunlong  WANG Tian
Affiliation:Faculty of Science, Kunming University of Science and Technology, Kunming 650500, China
Abstract:The lower and upper approximations are important concepts in rough set theory. Therefore, the computation of approximations is the basic for improving the massive data mining performance. Classical approximation space algorithm is infeasible for massive data, much less for massive data with missing information. To this end, through deep analysis of the characteristics of massive data with missing information, combining with the MapReduce programming model, a parallel algorithm for computing incomplete information systems using MapReduce is put forward to deal with the massive data with missing information. The experimental results demonstrate that the proposed parallel algorithm is effective.
Keywords:MapReduce  data mining  massive data  rough set  incomplete information system  approximations  
本文献已被 CNKI 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号