首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到17条相似文献,搜索用时 138 毫秒
1.
关系数据库中数量属性的关联规则挖掘问题是关联规则挖掘中经常要遇到的问题。该文利用遗传算法解决FCM模糊聚类问题主要是为了避免FCM算法的局部极小问题。利用聚类的结果可以使数量型属性关联规则转换成类别型属性,类别型属性再转化为布尔型属性,这样,即可以使用许多已有关联规则挖掘方法挖掘出有意义的规则。  相似文献   

2.
基于模糊关联规则的交通事故分析应用研究   总被引:3,自引:0,他引:3  
研究发现交通事故潜在规律,预测交通事故的发生,针对关联规则方法用于交通事故分析,对交通的数值型属性无法给出有效地划分,为提高安全管理决策,提出引入模糊聚类,用改进FCM( Fuzzy c-Means)方法对数值属性进行聚类,可用取值的范围对分类属性进行聚类,采用模糊关联规则挖掘导致交通事故的原因和规律.模糊关联规则首先对FCM算法进行了改进,包括隶属度、权值和中心点的计算和修正方法,利用模糊关联规则方法进行挖掘,最后对算法进行了仿真和可视化显示,结果表明模糊关联规则方法挖掘出的规则符合现实情况,为交通管理提供有效的方法.  相似文献   

3.
针对消费者对商品的偏好存在时序变化特征,而传统关联规则方法未考虑时间因素的影响,且对海量数据集进行关联挖掘时存在效率低下的问题,提出了基于艾宾浩斯遗忘曲线的模糊关联规则算法。该方法通过FCM聚类算法对商品进行聚类,并用艾宾浩斯遗忘曲线来修正聚类的距离度量方法,从而得到商品类及各类的代表点商品;然后将各代表点商品作为属性,消费记录小票作为样本,利用模糊关联规则算法得到代表点商品间的规则;最后将某大型超市一个月的销售记录作为关联规则的事务数据来挖掘潜在规律,结果显示所提算法先对商品模糊关联分析,与传统直接对商品进行关联分析相比,该算法可以显著提高关联挖掘的效率和规则的正确率。  相似文献   

4.
关联规则的冗余删除与聚类   总被引:9,自引:0,他引:9  
关联规则挖掘常常会产生大量的规则,这使得用户分析和利用这些规则变得十分困难,尤其是数据库中属性高度相关时,问题更为突出.为了帮助用户做探索式分析,可以采用各种技术来有效地减少规则数量,如约束性关联规则挖掘、对规则进行聚类或泛化等技术.本文提出一种关联规则冗余删除算法ADRR和一种关联规则聚类算法ACAR.根据集合具有的性质,证明在挖掘到的关联规则中存在大量可以删除的冗余规则,从而提出了算法ADRR;算法ACAR采用一种新的用项目间的相关性来定义规则间距离的方法,结合DBSCAN算法的思想对关联规则进行聚类.最后将本文提出的算法加以实现,实验结果表明该算法暑有数可行的.且具较高的效率。  相似文献   

5.
讨论了在多值属性关系中进行关联规则挖掘的应用特点,提出利用数据整理和数值编码的方式对关联 规则挖掘算法进行优化。将目标数据属性按其在算法中的作用划分,并分别进行转换和编码;然后对数据先进 行聚类,再在聚类结果中发掘频繁项目集;最后利用聚类后关联规则快速更新算法获取关联规则。算法分析和 实验结果表明,该算法比传统的关联规则挖掘算法更有效率。  相似文献   

6.
针对就业信息数据中存在着大量的量化属性和分类属性等现象,提出了一种基于k-means的量化关联规则挖掘方法。该方法利用聚类算法k-means对量化属性进行合理分区,将量化属性转化为布尔型;利用改进的布尔关联规则方法对此进行关联规则挖掘,找出学生的受教育属性和就业属性之间的关联性;对挖掘出的规则进行分析和运用。就业信息数据实验证明,文中所提方法对就业信息进行挖掘是有效的、可行的。  相似文献   

7.
基于聚类和关联规则的挖掘算法   总被引:4,自引:0,他引:4       下载免费PDF全文
数据挖掘技术中关联规则可以很好地发现数据项之间存在的相互关系,同时有大量的挖掘算法可供选择。聚类分析就是通过分析数据库中的记录数据,根据一定的分类规则合理地划分记录集合,确定每个记录所在类别。它所采用的分类规则是由聚类分析工具决定的。本文研究 聚类和关联规则的挖掘算法。  相似文献   

8.
关联规则在肿瘤诊断中的应用   总被引:6,自引:0,他引:6  
挖掘肿瘤诊断数据库中的关联规则,能为肿瘤诊断提供有用的信息。肿瘤诊断数据库中的属性常为数量型属性,因此如何将数量型属性离散化是挖掘关联规则的难点。竞争聚集算法综合了分层聚类与划分聚类的优点,它能够有效地体现数据的实际分布情况并得到优化的聚类个数,因此能将数量型属性离散化成若干个优化的区间。  相似文献   

9.
以超市的量化属性为研究对象,提出一种基于模糊聚类和减类聚类的量化关联规则算法.该算法基本思想是把模糊聚类技术融入到离散化过程中,使数据离散到合理的区间,再利用经典的布尔关联规则挖掘算法Apriori进行挖掘.实验证明,这种方法能够有效挖掘量化关联规则,提高交叉销售的可能性.  相似文献   

10.
针对就业信息数据中存在着大量的量化属性和分类属性等现象,提出了一种基于k-means的量化关联规则挖掘方法,该方法首先利用聚类算法k-means对量化属性进行合理分区,将量化属性转化为布尔型,其次利用改进的布尔关联规则方法对此进行关联规则挖掘,找出学生的受教育属性和就业属性之间的关联性,最后对挖掘出的规则进行分析和运用。就业信息数据实验证明,本文所提方法对就业信息进行挖掘是有效的、可行的,它为高校教育提供了指导和参考依据,在高校教育领域具有较好的应用前景。  相似文献   

11.
Mining optimized association rules with categorical and numericattributes   总被引:1,自引:0,他引:1  
Mining association rules on large data sets has received considerable attention in recent years. Association rules are useful for determining correlations between attributes of a relation and have applications in marketing, financial, and retail sectors. Furthermore, optimized association rules are an effective way to focus on the most interesting characteristics involving certain attributes. Optimized association rules are permitted to contain uninstantiated attributes and the problem is to determine instantiations such that either the support or confidence of the rule is maximized. In this paper, we generalize the optimized association rules problem in three ways: (1) association rules are allowed to contain disjunctions over uninstantiated attributes, (2) association rules are permitted to contain an arbitrary number of uninstantiated attributes, and (3) uninstantiated attributes can be either categorical or numeric. Our generalized association rules enable us to extract more useful information about seasonal and local patterns involving multiple attributes. We present effective techniques for pruning the search space when computing optimized association rules for both categorical and numeric attributes. Finally, we report the results of our experiments that indicate that our pruning algorithms are efficient for a large number of uninstantiated attributes, disjunctions, and values in the domain of the attributes  相似文献   

12.
关系表中数量联系规则的挖掘技术   总被引:8,自引:0,他引:8  
文章介绍了含有数量属性与类型属的 关系中联系规则的挖掘问题。采用将数量属性取值域区间划分办法,必须时合并相邻区间,引入部分寒带性概念,用来量化因划分而造成的信息遗失。  相似文献   

13.
A Statistical Theory for Quantitative Association Rules   总被引:7,自引:0,他引:7  
Association rules are a key data-mining tool and as such have been well researched. So far, this research has focused predominantly on databases containing categorical data only. However, many real-world databases contain quantitative attributes and current solutions for this case are so far inadequate. In this paper we introduce a new definition of quantitative association rules based on statistical inference theory. Our definition reflects the intuition that the goal of association rules is to find extraordinary and therefore interesting phenomena in databases. We also introduce the concept of sub-rules which can be applied to any type of association rule. Rigorous experimental evaluation on real-world datasets is presented, demonstrating the usefulness and characteristics of rules mined according to our definition.  相似文献   

14.
Association rules are one of the most frequently used tools for finding relationships between different attributes in a database. There are various techniques for obtaining these rules, the most common of which are those which give categorical association rules. However, when we need to relate attributes which are numeric and discrete, we turn to methods which generate quantitative association rules, a far less studied method than the above. In addition, when the database is extremely large, many of these tools cannot be used. In this paper, we present an evolutionary tool for finding association rules in databases (both small and large) comprising quantitative and categorical attributes without the need for an a priori discretization of the domain of the numeric attributes. Finally, we evaluate the tool using both real and synthetic databases.  相似文献   

15.
A novel multi-objective genetic algorithm (GA)-based rule-mining method for affective product design is proposed to discover a set of rules relating design attributes with customer evaluation based on survey data. The proposed method can generate approximate rules to consider the ambiguity of customer assessments. The generated rules can be used to determine the lower and upper limits of the affective effect of design patterns. For a rule-mining problem, the proposed multi-objective GA approach could simultaneously consider the accuracy, comprehensibility, and definability of approximate rules. In addition, the proposed approach can deal with categorical attributes and quantitative attributes, and determine the interval of quantitative attributes. Categorical and quantitative attributes in affective product design should be considered because they are commonly used to define the design profile of a product. In this paper, a two-stage rule-mining approach is proposed to generate rules with a simple chromosome design in the first stage of rule mining. In the second stage of rule mining, entire rule sets are refined to determine solutions considering rule interaction. A case study on mobile phones is used to demonstrate and validate the performance of the proposed rule-mining method. The method can discover rule sets with good support and coverage rates from the survey data.  相似文献   

16.
刘均  李人厚  郑庆华 《计算机工程》2005,31(9):17-18,33
针对基于区间划分的数值型关联规则分析方法存在的问题,提出了一种基于进制转换的分析方法.此方法将十进制的属性值转换为二进制,并将二进制数值的每一位映射为二个布尔型属性,在此基础上,再采用布尔型关联规则分析方法.相对于基于区间划分的方法,此方法的优点是转化过程不存在失真,且能有效地降低后续分析过程的时间复杂度.  相似文献   

17.
发掘多值属性的关联规则   总被引:45,自引:1,他引:45  
张朝晖  陆玉昌  张钹 《软件学报》1998,9(11):801-805
属性值可以取布尔量或多值量.从以布尔量描述的数据中发掘关联规则已经有比较成熟的系统和方法,而对于多值量则不然.将多值量的数据转化为布尔型的数据是一条方便、有效的途径.提出一种算法,根据数据本身的情况决定多值量的划分,进而将划分后的区段映射为布尔量,在此基础上可发掘容易理解且具有概括性的、有效的关联规则.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号