首页 | 本学科首页   官方微博 | 高级检索  
     

一种高效用项集并行挖掘算法
引用本文:宋威,吉红蕾,李晋宏.一种高效用项集并行挖掘算法[J].计算机工程与科学,2015,37(3):422-428.
作者姓名:宋威  吉红蕾  李晋宏
作者单位:北方工业大学计算机学院,北京,100144
基金项目:国家自然科学基金资助项目(61105045,51075423);北京市属市管高等学校人才强教计划资助项目(PHR201108057);北方工业大学科研人才提升计划资助项目(CCXZ201303)
摘    要:由于能反映用户的偏好,可以弥补传统频繁项集挖掘仅由支持度来衡量项集重要性的不足,高效用项集正在成为当前数据挖掘研究的热点。为使高效用项集挖掘更好地适应数据规模不断增大的实际需求,提出了一种高效用项集的并行挖掘算法PHUI-Mine。提出了记录挖掘高效用项集信息的DHUI-树结构,描述了DHUI-树的构造方法,论证了DHUI-树的动态剪枝策略。在此基础上,给出了高效用项集挖掘的并行算法描述。实验结果表明,PHUI-Mine算法具有较高的挖掘效率及较低的存储开销。

关 键 词:数据挖掘  高效用项集  并行算法  动态高效用项集树
收稿时间:2013-10-10
修稿时间:2014-01-09

A parallel algorithm for mining high utility itemsets
SONG Wei , JI Hong-lei , LI Jin-hong.A parallel algorithm for mining high utility itemsets[J].Computer Engineering & Science,2015,37(3):422-428.
Authors:SONG Wei  JI Hong-lei  LI Jin-hong
Affiliation:(College of Computer,North China University of Technology,Beijing 100144,China)
Abstract:Mining high utility itemsets is becoming a hot research topic in data mining owing to its ability to reflect users’preferences and make up for the shortcoming of measuring itemsets only by support degree.To meet the needs of larger data size,a parallel algorithm,called Parallel High Utility Itemset Mine (PHUI Mine ),for mining high utility itemsets is proposed.Firstly,a tree structure,called DHUI tree, is introduced to capture the information of high utility itemsets. Meanwhile, the DHUI tree construction method is described,and the dynamic pruning strategy of DHUI tree is discussed.Then, the parallel algorithm is presented. Experimental results show that PHUI Mine algorithm is efficient and has low storage cost.
Keywords:data mining  high utility itemset  parallel algorithm  DHUI-tree
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号