首页 | 本学科首页   官方微博 | 高级检索  
     

基于图的关联规则改进算法
引用本文:黄红星.基于图的关联规则改进算法[J].计算机与数字工程,2009,37(12):38-41,162.
作者姓名:黄红星
作者单位:福建农林大学计算机与信息学院,福州,350002
摘    要:关联规则挖掘是数据挖掘研究的最重要课题之一。基于图的关联规则挖掘DLG算法通过一次扫描数据库构建关联图,然后遍历该关联图产生频繁项集,有效地提高了关联规则挖掘的性能。在分析该算法基本原理基础上,提出了一种改进的算法—DLG#。改进算法在关联图构造同时构造项集关联矩阵,在候选项集生成时结合关联图和Apriori性质对冗余项集进行剪枝,减少了候选项集数,简化了候选项集的验证。比较实验结果表明,在不同数据集和不同支持度阈值下,改进算法都能更快速的发现频繁项集,当频繁项集平均长度较大时性能提高明显。

关 键 词:数据挖掘  关联规则  频繁项集  关联图  关联矩阵

Revised Algorithm of Mining Association Rules Based on Graph
Huang Hongxing.Revised Algorithm of Mining Association Rules Based on Graph[J].Computer and Digital Engineering,2009,37(12):38-41,162.
Authors:Huang Hongxing
Affiliation:Huang Hongxing (College of Computer and Information, Fujian Agriculture and Forestry University, Fuzhou 350002)
Abstract:Mining association rules is one of the most important research field of data mining. The algorithm of mining association rules based on graph that named DLG scans the database once to construct an association graph, and then traverses the graph to generate frequent itemsets, which improves the performance of mining association rule efficiently. The basic principle of DLG is analyzed, a revised algorithm that named DLG# is proposed. The revised algorithm construct an association matrix and an association graph at the same time and in the phase of generating candidate itemsets the Apriori property based on association graph is utilized to prune the redundancy, thus the number of candidates is cut down and the validation of candidates is simple. Compared experiment results show that the revised algorithm can be more rapid to discovery frequent itemsets under different datasets and different support thresholds, the performance improve significantly when the average length of frequent itemsets is large.
Keywords:date mining  association rule  frequent itemsets  association graph  association matrix
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号