首页 | 本学科首页   官方微博 | 高级检索  
     

基于改进倒排表和集合的最频繁项集挖掘算法
引用本文:陈小玉,杨艳燕,刘克成,朱颢东.基于改进倒排表和集合的最频繁项集挖掘算法[J].计算机应用研究,2012,29(6):2135-2137.
作者姓名:陈小玉  杨艳燕  刘克成  朱颢东
作者单位:1. 南阳理工学院 计算机科学与技术系,河南 南阳,473004
2. 郑州轻工业学院 计算机与通信工程学院,郑州,450002
基金项目:河南省教育厅自然科学研究指导计划项目(2010C520007)
摘    要:最频繁项集挖掘是文本关联规则挖掘中研究的重点和难点,它决定了文本关联规则挖掘算法的性能。针对当前在最频繁项集挖掘方面的不足,将集合论引入倒排表以对其进行改进,然后以此为基础提出了几个命题和推论,并结合最小支持度阈值动态调整策略,提出了一个基于改进的倒排表和集合理论的最频繁项集挖掘算法,最后对所提算法进行验证。实验结果表明,所提算法的规则有效率和时间性能比常用的两个最频繁项集挖掘算法,即NApriori和IntvMatrix算法都好。

关 键 词:最频繁项集  文本关联规则  倒排表  集合理论

Most frequent itemset mining algorithm based on improved inverted list and set theory
CHEN Xiao-yu,YANG Yan-yan,LIU Ke-cheng,ZHU Hao-dong.Most frequent itemset mining algorithm based on improved inverted list and set theory[J].Application Research of Computers,2012,29(6):2135-2137.
Authors:CHEN Xiao-yu  YANG Yan-yan  LIU Ke-cheng  ZHU Hao-dong
Affiliation:1. Dept. of Computer Science & Technology, Nanyang Institute of Technology, Nanyang Henan 473004, China; 2. School of Computer Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China
Abstract:Most frequent item sets mining is the focus and the difficulty of text association rules mining, and directly determines the performance of text association rules mining algorithms. Aiming at shortcomings existing in most frequent item sets mining algorithms, this paper improved traditional inverted list, it combined minimum support threshold dynamic adjustment strategy and presented a new most frequent itemset mining algorithm based on improved inverted list and set theory. In addition, it also offered several propositions and deductions which were used to improve the performance of the provided algorithm. Finally, through experiment testing, the provided algorithm is better in effective rate of rules and time performance than NApriori and IntvMatrix which are two frequently-used most frequent itemsets mining algorithms.
Keywords:most frequent itemsets  text association rules  inverted list  set theory
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号