首页 | 本学科首页   官方微博 | 高级检索  
     

基于概率分布及维度编码的关联规则挖掘
引用本文:王盛,董黎刚,李群. 基于概率分布及维度编码的关联规则挖掘[J]. 计算机工程, 2011, 37(5): 65-67,70
作者姓名:王盛  董黎刚  李群
作者单位:浙江工商大学信息与电子工程学院,杭州,310018
摘    要:设计一种基于二进制数及项目的支持度分布的Apriori改进算法BF-Apriori。该算法通过分析项目的概率分布并对项目集中的项目按概率从大到小进行排序,经维度编码为二进制数后,降低事务数据库的读取开销和存储开销,同时采用切片运算和剪枝技术降低规则挖掘运算的时间复杂度。实验结果表明,BF-Apriori算法降低了50%左右的存储开销及400%以上的执行时间,能提高数据挖掘的存储效率和运算速度。

关 键 词:项目支持度分布  行向量逆序转换  列向量的转换  切片运算  逆序编码

Association Rules Mining Based on Probability Distribution and Dimensions Coding
WANG Sheng,DONG Li-gang,LI Qun. Association Rules Mining Based on Probability Distribution and Dimensions Coding[J]. Computer Engineering, 2011, 37(5): 65-67,70
Authors:WANG Sheng  DONG Li-gang  LI Qun
Affiliation:(College of Information & Electronic Engineering,Zhejiang Gongshang University,Hangzhou 310018,China)
Abstract:This paper designs an improved algorithm named BF-Apriori based on Binary and item support distribution.The algorithm analyses the probability distribution of the items,sorts them in descending order of the probability,and applies dimensions coding to reduce the cost of the database transactions to read and store overhead.While the slice operation and effective pruning scheme are used to reduce the time complexity of rule mining computing.Experimental results show BF-Apriori algorithm reduces about 50% of the storage and more than 400% of the execution time,it can improve the storage efficiency and computational speed in data mining.
Keywords:item support distribution  Reverse Transform on Row(RTR)  Transform on Column(TC)  slice operation  reverse coding
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号