首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 281 毫秒
1.
关联规则挖掘是数据挖掘领域中的重要研究内容之一。然而,传统的基于支持度-可信度框架的挖掘方法可能会产生大量不相关、甚至是误导的关联规则。针对现有关联规则挖掘的评价标准存在的问题,提出在评价标准中增加兴趣度,并给出了兴趣度的定义和基于兴趣度的关联规则挖掘算法。利用兴趣度将关联规则分为正关联规则和负关联规则,从而可以用算法挖掘带有负项的关联规则。实验结果分析表明,在传统挖掘方法的基础上引入兴趣度,可以有效地减少正关联规则的规模,产生有意义的负关联规则。  相似文献   

2.
关联规则挖掘是数据挖掘领域中的重要研究内容之一。然而,传统的基于支持度-可信度框架的挖掘方法可能会产生大量不相关、甚至是误导的关联规则。针对现有关联规则挖掘的评价标准存在的问题,提出在评价标准中增加兴趣度,并给出了兴趣度的定义和基于兴趣度的关联规则挖掘算法。利用兴趣度将关联规则分为正关联规则和负关联规则,从而可以用算法挖掘带有负项的关联规则。实验结果分析表明,在传统挖掘方法的基础上引入兴趣度,可以有效地减少正关联规则的规模,产生有意义的负关联规则。  相似文献   

3.
关系数据库中数量属性的关联规则挖掘问题是关联规则挖掘中经常要遇到的问题。该文利用遗传算法解决FCM模糊聚类问题主要是为了避免FCM算法的局部极小问题。利用聚类的结果可以使数量型属性关联规则转换成类别型属性,类别型属性再转化为布尔型属性,这样,即可以使用许多已有关联规则挖掘方法挖掘出有意义的规则。  相似文献   

4.
影响关联规则挖掘效率的主要因素是如何快速地求出频繁项目集,文章在分析关联规则挖掘基本原理及算法的基础上,研究一种从最大频繁项集生成所有强关联规则的优化方法,对快速生成关联规则具有一定意义。  相似文献   

5.
广义关联规则及算法研究   总被引:2,自引:0,他引:2  
挖掘广义关联规则是数据挖掘研究的一个重要方面,数据挖掘领域的研究者在挖掘广义关联规则上作了大量的工作,使之成为一个具有普遍和实用意义的数据挖掘方法。文章就挖掘广义关联规则的算法进行了深入的研究。  相似文献   

6.
OLAP关联规则挖掘   总被引:17,自引:1,他引:17  
该文提出一种新的关联规则挖掘方法,OLAP关联规则挖掘。OLAP关联规则挖掘是OLAP技术和一些高效的关联规则挖掘算法的结合。OLAP关联规则挖掘方法是一种灵活的、多维的、多层次的高性能方法。该文首先介绍了O-LAP关联规则挖掘的结构,最后详述了OLAP关联规则挖掘的具体实现。  相似文献   

7.
讨论了区间值关系数据库上模糊关联规则的挖掘算法与预测方法。采用一种比RFCM算法省时的FCMdd算法将记录在属性的取值划分成若干个模糊集,并提出区间值关系数据库上模糊关联规则的挖掘算法。仿真实例说明挖掘算法能够通过挖掘有意义的模糊关联规则来发现区间值关系数据库中蕴涵的关联性。区间值关系数据库上模糊关联规则的预测方法改进了标准可加性模型,并通过遗传算法调整模糊关联规则中三角模糊数的参数来提高预测的精度。  相似文献   

8.
关联规则技术是数据挖掘的最重要的组成部分之一,它用于发现大量数据中项集之间的有意义的关联和相关联系。本文介绍了使用关联规则挖掘算法FP-growth分析学生选课数据的方法。  相似文献   

9.
一个最优分类关联规则算法   总被引:1,自引:0,他引:1  
分类和关联规则发现是数据挖掘中的两个重要领域。使用关联规则算法挖掘分类规则被叫做分类关联规则算法,是一个有较好前景的方法。本文提出了一个最优分类关联规则算法——OCARA。该算法使用最优关联规则挖掘算法挖掘分类规则,并对最优规则集排序,从而获得一个分类精度较高的分类器。将OCARA与传统分类算法C4.5和一般分类关联规则算法CBA、RMR在8个UCI数据集上进行实验比较,结果显示OCARA具有更好的性能,证明OCARA是一个有效的分类关联规则挖掘算法。  相似文献   

10.
模糊聚类在数量型关联规则提取中的应用   总被引:1,自引:0,他引:1  
王越  曹长修 《计算机仿真》2003,20(11):64-66,69
关系数据库中数量属性的关联规则挖掘问题是经常要遇到的问题。该文利用改进的FCM进行模糊聚类,主要是解决FCM算法的局部极小问题。利用聚类的结果可以使数量型属性关联规则向类别型属性转换,类别型属性再转化为布尔型属性,这样,便可以从许多关联规则的挖掘方法中找出有意义的规则。  相似文献   

11.
In Association rule mining, the quantitative attribute values are converted into Boolean values using fixed intervals. Conventional association rule mining algorithms are then applied to find relations among the attribute values. These intervals may not be concise and meaningful enough for human users to easily obtain non trivial knowledge from those rules discovered. Clustering techniques can be used for segmenting quantitative values into meaningful groups instead of fixed intervals. But the conventional clustering techniques like k-means and c-means require the user to specify the number of clusters and initial cluster centres. This initialization is one of the major challenges of clustering. A novel fuzzy based unsupervised clustering algorithm proposed by the authors is extended to segment quantitative values into fuzzy clusters in this paper. Membership values of quantitative items in the partitioning fuzzy clusters are used with weighted fuzzy rule mining techniques to find natural association rules. This fuzzy based method for handling quantitative attributes is compared with that of fixed intervals and segmenting using conventional k-means clustering method along with Apriori algorithm.  相似文献   

12.
数据挖掘是从数据库中发现潜在有用知识或者感兴趣模式的过程。在数据挖掘领域中主要集中于单一支持度下的关联规则挖掘,在事务数据库中发现项目之间的关联性,而在实际应用中,项目可以有不同的最小支持度,不同的项目可能具有不同的标准去判断其重要性,因此提出一个在最大值支持度约束下,发现有用的模糊关联规则挖掘算法,在该约束下,利用逐层搜索的迭代方法发现频繁项目集,通过实例证明了该挖掘算法是易于理解和有意义的,具有很好的效率。  相似文献   

13.

The volume of published linked open datasets in RDF format is growing exponentially in the last decades. With this continuous proliferation of this growth, demands for managing, accessing, and compressing the RDF dataset have become increasingly important. Most approaches are focused on the structured compression technique while a very few researches have been done for compact representation of the RDF dataset. In this paper, we have proposed an efficient rule mining and compression approach for RDF datasets through various meaningful semantic association rules determined from the RDF graph. We have introduced grammar-based pattern system, clustering of rules, rules pruning, and Top-k scheme to improve the expressiveness of rule patterns, identify the similarity within the random pair of rules, extract the most delicate rules, find the accurate mining threshold, and efficiently learn the rules during the rule mining process from RDF Knowledge Base. Our proposed system uses Horn rules to achieve better compression through storing the triples matched with the precedent part while deleting the triples matched with the head part of the rules. For decreasing the mining time, we have introduced the ranking of the rules. The experimental result on the benchmark dataset asserts that our proposed rule mining and compression scheme has achieved approximately 22.10%, 40.5%, and 44% better compression than the exiting AMIE+, Rule-based compression, and TripleBit approaches, respectively. Our system also has achieved better performance both in terms of compression time and rule mining cost.

  相似文献   

14.
Two parameters, namely support and confidence, in association rule mining, are used to arrange association rules in either increasing or decreasing order. These two parameters are assigned values by counting the number of transactions satisfying the rule without considering user perspective. Hence, an association rule, with low values of support and confidence, but meaningful to the user, does not receive the same importance as is perceived by the user. Reflecting user perspective is of paramount importance in light of improving user satisfaction for a given recommendation system. In this paper, we propose a model and an algorithm to extract association rules, meaningful to a user, with an ad-hoc support and confidence by allowing the user to specify the importance of each transaction. In addition, we apply the characteristics of a concept lattice, a core data structure of Formal Concept Analysis (FCA) to reflect subsumption relation of association rules when assigning the priority to each rule. Finally, we describe experiment results to verify the potential and efficiency of the proposed method.  相似文献   

15.
一种改进的Apriori挖掘关联规则算法   总被引:2,自引:0,他引:2  
关联规则挖掘可以发现大量数据中项集之间有趣的联系,并已在许多领域得到了广泛的应用。但传统关联规则挖掘很少考虑数据项的重要程度,这些算法认为每个数据对规则的重要性相同,实际挖掘的结果不是很理想。为了挖掘出更具有价值的规则,文中提出了一种加权的关联规则算法,即用频度和利润来标识该项的重要性,然后对经典Apriori算法进行改进。最后用实例对改进后算法进行验证,结果证明改进后算法是合理有效的,能够挖掘出更具价值的信息。  相似文献   

16.
王妍  王丽君  方芸 《微机发展》2012,(1):137-139,156
为了解决商品进货无关联的现状,找到商品间的关联规则,更好地进行商品的搭配进货,从而提高进货效率,文中引入了关联规则的思想,并利用规则进行了商品关联规则的挖掘。在分析了关联规则挖掘的算法后,将其应用到超市商品数据库中,利用关联规则挖掘出大量数据中项集即商品之间的相互关联,并抽取出有价值的商品关联规则,利用支持度和平衡度这两个度量概念,优化出强规则集,并用这一思想成功设计了PLM即产品全生命周期管理中的搭配进货系统。  相似文献   

17.
数据挖掘和专家系统同属人工智能领域。关联规则是数据挖掘的一种方法,它的最典型的应用是超市的购物篮分析。专家系统主要解决的是智能推理问题而关联规则侧重于各个数据项之间有价值的联系。通过对关联规则的Apriori算法及规则的产生方法进行改动,挖掘出可应用于专家系统的知识库中的决策规则,从而找出了利用关联规则挖掘出用于决策的规则的方法。  相似文献   

18.
The association rule mining is one of the most popular data mining techniques, however, the users often experience difficulties in interpreting and exploiting the association rules extracted from large transaction data with high dimensionality. The primary reasons for such difficulties are two-folds. Firstly, too many association rules can be produced by the conventional association rule mining algorithms, and secondly, some association rules can be partly overlapped. This problem can be addressed if the user can select the relevant items to be used in association rule mining, however, there are often quite complex relations among the items in large transaction data. In this context, this paper aims to propose a novel visual exploration tool, structured association map (SAM), which enables the users to find the group of the relevant items in a visual way. The appearance of SAM is similar with the well-known cluster heat map, however, the items in SAM are sorted in more intelligent way so that the users can easily find the interesting area formed by a set of associated items, which are likely to constitute interesting many-to-many association rules. Moreover, this paper introduces an index called S2C, designed to evaluate the quality of SAM, and explains the SAM based association analysis procedure in a comprehensive manner. For illustration, this procedure is applied to a mass health examination result data set, and the experiment results demonstrate that SAM with high S2C value helps to reduce the complexities of association analysis significantly and it enables to focus on the specific region of the search space of association rule mining while avoiding the irrelevant association rules.  相似文献   

19.
针对单一层次结构实现规则提取具有规则提取准确性不高、算法运行时间长、难以满足用户使用需求的问题,提出一种基于改进多层次模糊关联规则的定量数据挖掘算法。采用高频项目集合,通过不断深化迭代的方法形成自顶向下的挖掘过程,整合模糊集合理论、数据挖掘算法以及多层次分类技术,从事务数据集中寻找模糊关联规则,挖掘出储存在多层次结构事务数据库中定量值信息的隐含知识,实现用户的定制化信息挖掘需求。实验结果表明,提出的数据挖掘算法在挖掘精度和运算时间方面相较于其他算法具有突出优势,可为多层次关联规则提取方法的实际应用带来新的发展空间。  相似文献   

20.
锌钡白回转窑煅烧过程的数据是非交易型、不变的事务模式,基于支持-置信度关联规则挖掘算法求得的关联规则,不能用于锌钡白回转窑煅烧现场.论文提出基于实数编码的克隆选择算法,采用多点随机搜索策略进行搜索,对锌钡白回转窑煅烧现场数据进行分析,得到现场数据中的关联规则,并用两组数据仿真对比,实验结果表明,算法可快速有效地求出生产数据中的关联规则,且与工人现场操作经验基本吻合,可用于类似的化工过程控制。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号