首页 | 本学科首页   官方微博 | 高级检索  
     

关联规则的冗余删除与聚类
引用本文:韦素云,吉根林,曲维光.关联规则的冗余删除与聚类[J].小型微型计算机系统,2006,27(1):110-113.
作者姓名:韦素云  吉根林  曲维光
作者单位:1. 南京师范大学,计算机系,江苏,南京,210097
2. 苏州大学,省计算机信息处理重点实验室,江苏,苏州,215006
摘    要:关联规则挖掘常常会产生大量的规则,这使得用户分析和利用这些规则变得十分困难,尤其是数据库中属性高度相关时,问题更为突出.为了帮助用户做探索式分析,可以采用各种技术来有效地减少规则数量,如约束性关联规则挖掘、对规则进行聚类或泛化等技术.本文提出一种关联规则冗余删除算法ADRR和一种关联规则聚类算法ACAR.根据集合具有的性质,证明在挖掘到的关联规则中存在大量可以删除的冗余规则,从而提出了算法ADRR;算法ACAR采用一种新的用项目间的相关性来定义规则间距离的方法,结合DBSCAN算法的思想对关联规则进行聚类.最后将本文提出的算法加以实现,实验结果表明该算法暑有数可行的.且具较高的效率。

关 键 词:关联规则  相关性  聚类
文章编号:i000-1220(2006)01-0110-04
收稿时间:2005-02-08
修稿时间:2005-02-08

Pruning and Clustering Discovered Association Rules
WEI Su-yun,JI Gen-lin,QU Wei-guang.Pruning and Clustering Discovered Association Rules[J].Mini-micro Systems,2006,27(1):110-113.
Authors:WEI Su-yun  JI Gen-lin  QU Wei-guang
Affiliation:1.Department of Computer Science, Nanjing Normal University, Nanjing 210097, China; 2.Jiangsu Province Key Laboratory of Information Processing, Suzhou University, Suzhou 215016, China
Abstract:A common problem in association rule mining is that a large number of rules are often generated from the databases, which makes it difficult for users to analyze and makes use of these rules. This is particularly true for data sets whose attributes are highly correlated. To facilitate exploratory analysis, the number of rules can be reduced significantly by techniques such as mining association rules with constraint items, post-pruning or clustering and summarizing rules. This paper proposed algorithms ADRR and ACAR to overcome this problem. Firstly, algorithm ADRR prunes the discovered associations by removing those redundant associations according to the property of the set, and then algorithm ACAR makes use of the correlation information of the items to measure the distances between rules, Therefore, clustering algorithm DBSCAN is applied to generate the clustering structure suitable for exploratory analysis. Finally, an experiment is conducted on a real-life database and the experimental result shows that the method is practical and effective.
Keywords:association rules  correlation  clustering
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号