首页 | 本学科首页   官方微博 | 高级检索  
     

NIA2: A fast indirect association mining algorithm
引用本文:倪旻,徐晓飞,邓胜春,问晓先.NIA2: A fast indirect association mining algorithm[J].哈尔滨工业大学学报(英文版),2005,12(5):511-516.
作者姓名:倪旻  徐晓飞  邓胜春  问晓先
作者单位:Dept. of Computer Science and Engineering, Harbin Institute of Technology, Harbin 150001, China
摘    要:Indirect association is a high level relationship between items and frequent item sets in data. There are many potential applications for indirect associations, such as database marketing, intelligent data analysis, web -log analysis, recommended system, etc. Existing indirect association mining algorithms are mostly based on the notion of post - processing of discovery of frequent item sets. In the mining process, all frequent item sets need to be generated first, and then they are fihered and joined to form indirect associations. We have presented an indirect association mining algorithm (NIA) based on anti -monotonicity of indirect associations whereas k candidate indirect associations can be generated directly from k - 1 candidate indirect associations, without all frequent item sets generated. We also use the frequent itempair support matrix to reduce the time and memory space needed by the algorithm. In this paper, a novel algorithm (NIA2) is introduced based on the generation of indirect association patterns between itempairs through one item mediator sets from frequent itempair support matrix. A notion of mediator set support threshold is also presented. NIA2 mines indirect association patterns directly from the dataset, without generating all frequent item sets. The frequent itempair support matrix and the notion of using tm as the support threshold for mediator sets can significantly reduce the cost of joint operations and the search process compared with existing algorithms. Results of experiments on a real - word web log dataset have proved NIA2 one order of magnitude faster than existing algorithms.

关 键 词:数据采集  联合采集算法  间接采集  频率支撑矩阵
文章编号:1005-9113(2005)05-0511-06
收稿时间:2003-06-18

NIA2: A fast indirect association mining algorithm
Ni Min;Xu XiaoFei;Deng ShengChun;Wen XiaoXian.NIA2: A fast indirect association mining algorithm[J].Journal of Harbin Institute of Technology,2005,12(5):511-516.
Authors:Ni Min;Xu XiaoFei;Deng ShengChun;Wen XiaoXian
Abstract:Indirect association is a high level relationship between items and frequent item sets in data. There are many potential applications for indirect associations, such as database marketing, intelligent data analysis, web-log analysis, recommended system, etc. Existing indirect association mining algorithms are mostly based on the notion of post-processing of discovery of frequent item sets. In the mining process, all frequent item sets need to be generated first, and then they are filtered and joined to form indirect associations. We have presented an indirect association mining algorithm (NIA) based on anti-monotonicity of indirect associations whereas k candidate indirect associations can be generated directly from k-1 candidate indirect associations, without all frequent item sets generated. We also use the frequent itempair support matrix to reduce the time and memory space needed by the algorithm. In this paper, a novel algorithm (NIA2) is introduced based on the generation of indirect association patterns between itempairs through one item mediator sets from frequent itempair support matrix. A notion of mediator set support threshold is also presented. NIA2 mines indirect association patterns directly from the dataset, without generating all frequent item sets. The frequent itempair support matrix and the notion of using t_m as the support threshold for mediator sets can significantly reduce the cost of joint operations and the search process compared with existing algorithms. Results of experiments on a real-word web log dataset have proved NIA2 one order of magnitude faster than existing algorithms.
Keywords:data mining  association rule mining  indirect association  frequent itempair support matrix  mediator set support threshold
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号