首页 | 本学科首页   官方微博 | 高级检索  
     


Discovering knowledge from large databases using prestored information
Affiliation:1. School of Communication, Journalism and Marketing, Massey University Albany Campus, Private Bag 102 904, North Shore, Auckland 0745, New Zealand;2. Department of Marketing, City University of Hong Kong, Kowloon Tong, Hong Kong;3. National Sun-Yat Sen University, College of Management, 70 Lien-Hai Road, Kaohsiung 804, Taiwan, ROC;1. Hunter Retrieval Service, Hunter New England Local Health District, New Lambton Heights, New South Wales, Australia;2. New South Wales Ambulance, Marks Point, New South Wales, Australia;3. Hunter Aero Trim, Tighes Hill, New South Wales, Australia
Abstract:In this paper, we examine the two issues of mining association rules and mining sequential patterns in a large database of sales transactions. The problems of mining association rules and mining sequential patterns focus on discovering large itemsets and large sequences, respectively. We present PSI and PSI_seq for efficient large itemsets generation and large sequences generation, respectively. The main ideas of these two algorithms are using prestored information to minimize the numbers of candidate itemsets and candidate sequences counted in each database scan. The prestored informations for PSI and PSI_seq include the itemsets and the sequences along with their support counts found in the last mining, respectively. Typically a user may require to tune the value of the minimum support many times before a set of useful association rules can be obtained from the transaction database. Using prestored information, the total computation time will be reduced effectively. Empirical results show that our approaches outperform previous methods by an order of magnitude, using little storage space for the prestored information.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号