Mining top-<em>k</em> regular-frequent itemsets using database partitioning and support estimation期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Mining top-k regular-frequent itemsets using database partitioning and support estimation

Authors:	Komate Amphawan Philippe Lenca Athasit Surarerks

Affiliation:	^a Chulalongkorn University, ELITE Laboratory, 10330 Bangkok, Thailand ^b Institut Telecom, Telecom Bretagne, UMR CNRS 3192 Lab-STICC, France ^c Université européenne de Bretagne, France

Abstract:	Temporal regularity of itemset appearance can be regarded as an important criterion for measuring the interestingness of itemsets in several applications. A frequent itemset can be said to be regular-frequent in a database if it appears at a regular period. Therefore, the problem of mining a complete set of regular-frequent itemsets requires the specification of a support and a regularity threshold. However, in practice, it is often difficult for users to provide an appropriate support threshold. In addition, the use of a support threshold tends to produce a large number of regular-frequent itemsets and it might be better to ask for the number of desired results. We thus propose an efficient algorithm for mining top-k regular-frequent itemsets without setting a support threshold. Based on database partitioning and support estimation techniques, the proposed algorithm also uses a best-first search strategy with only one database scan. We then compare our algorithm with the state-of-the-art algorithms for mining top-k regular-frequent itemsets. Our experimental studies on both synthetic and real data show that our proposal achieves high performance for small and large values of k.

Keywords:	Data mining Association rule Frequent itemset Top-k itemset mining Regular-frequent itemset
本文献已被 ScienceDirect 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏