首页 | 本学科首页   官方微博 | 高级检索  
     

含负项top-k高效用项集挖掘算法
引用本文:孙蕊,韩萌,张春砚,申明尧,杜诗语. 含负项top-k高效用项集挖掘算法[J]. 计算机应用, 2021, 41(8): 2386-2395. DOI: 10.11772/j.issn.1001-9081.2020101561
作者姓名:孙蕊  韩萌  张春砚  申明尧  杜诗语
作者单位:北方民族大学 计算机科学与工程学院, 银川 750021
基金项目:国家自然科学基金资助项目(62062004);宁夏自然科学基金资助项目(2020AAC03216);计算机应用技术自治区重点学科项目(PY1902);宁夏高等学校一流学科建设项目(电子科学与技术学科)(NXYKXY2017A07)。
摘    要:含负项高效用项集(HUI)挖掘是新兴的数据挖掘任务之一.为了挖掘满足用户需求的含负项HUI结果集,提出了含负项top-k高效用项集(THN)挖掘算法.为了提升THN算法的时空性能,提出了自动提升最小效用阈值的策略,并采用模式增长方法进行深度优先搜索;使用重新定义的子树效用和重新定义的本地效用修剪搜索空间;使用事务合并技...

关 键 词:项集挖掘  高效用项集  top-k项集  负项  正项
收稿时间:2020-10-10
修稿时间:2020-12-07

Algorithm for mining top-k high utility itemsets with negative items
SUN Rui,HAN Meng,ZHANG Chunyan,SHEN Mingyao,DU Shiyu. Algorithm for mining top-k high utility itemsets with negative items[J]. Journal of Computer Applications, 2021, 41(8): 2386-2395. DOI: 10.11772/j.issn.1001-9081.2020101561
Authors:SUN Rui  HAN Meng  ZHANG Chunyan  SHEN Mingyao  DU Shiyu
Affiliation:School of Computer Science and Engineering, North Minzu University, Yinchuan Ningxia 750021, China
Abstract:Mininng High Utility Itemsets (HUI) with negative items is one of the emerging itemsets mining tasks. In order to mine the result set of HUI with negative items meeting the user needs, a Top-k High utility itemsets with Negative items (THN) mining algorithm was proposed. In order to improve the temporal and spatial performance of the THN algorithm, a strategy to automatically increase the minimum utility threshold was proposed, and the pattern growth method was used for depth-first search; the search space was pruned by using the redefined subtree utility and the redefined local utility; the transaction merging technology and dataset projection technology were employed to solve the problem of scanning the database for multiple times; in order to increase the utility counting speed, the utility array counting technology was used to calculate the utility of the itemset. Experimental results show that the memory usage of THN algorithm is about 1/60 of that of the HUINIV (High Utility Itemsets with Negative Item Values)-Mine algorithm, and is about 1/2 of that of the FHN (Faster High utility itemset miner with Negative unit profits) algorithm; the THN algorithm takes 1/10 runtime of that of the FHN algorithm; and the THN algorithm achieves better performance on dense datasets.
Keywords:itemset mining  high utility itemset  top-k itemset  negative item  positive item  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号