首页 | 本学科首页   官方微博 | 高级检索  
     


An improved frequent pattern growth method for mining association rules
Authors:Ke-Chung Lin  I-En Liao  Zhi-Sheng Chen
Affiliation:1. Universitá di Perugia, Dipartimento di Matematica e Informatica, Via Vanvitelli n.1, Perugia, 06123, Italy;2. Politecnico di Bari, Dipartimento di Ingegneria Elettrica e dell’Informazione, Via E. Orabona n. 4, Bari, 70125, Italy;1. Department of Computer Engineering, Sejong University, Seoul, Republic of Korea;2. Faculty of Software and Information Science, Iwate Prefectural University (IPU), Iwate, Japan
Abstract:Many algorithms have been proposed to efficiently mine association rules. One of the most important approaches is FP-growth. Without candidate generation, FP-growth proposes an algorithm to compress information needed for mining frequent itemsets in FP-tree and recursively constructs FP-trees to find all frequent itemsets. Performance results have demonstrated that the FP-growth method performs extremely well. In this paper, we propose the IFP-growth (improved FP-growth) algorithm to improve the performance of FP-growth. There are three major features of IFP-growth. First, it employs an address-table structure to lower the complexity of forming the entire FP-tree. Second, it uses a new structure called FP-tree+ to reduce the need for building conditional FP-trees recursively. Third, by using address-table and FP-tree+ the proposed algorithm has less memory requirement and better performance in comparison with FP-tree based algorithms. The experimental results show that the IFP-growth requires relatively little memory space during the mining process. Even when the minimum support is low, the space needed by IFP-growth is about one half of that of FP-growth and about one fourth of that of nonordfp algorithm. As to the execution time, our method outperforms FP-growth by one to 300 times under different minimum supports. The proposed algorithm also outperforms nonordfp algorithm in most cases. As a result, IFP-growth is very suitable for high performance applications.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号