首页 | 本学科首页   官方微博 | 高级检索  
     

基于iceberg概念格并置集成的闭频繁项集挖掘算法
引用本文:王黎明,张卓.基于iceberg概念格并置集成的闭频繁项集挖掘算法[J].计算机研究与发展,2007,44(7):1184-1190.
作者姓名:王黎明  张卓
作者单位:郑州大学信息工程学院,郑州,450052
摘    要:由于概念格的完备性,在基于概念格的数据挖掘过程中,构造概念格的时间复杂度和空间复杂度一直是影响其应用的主要因素.结合iceberg概念格的半格特性和概念格的集成思想,首先在理论上分析并置集成后的iceberg概念格与由完备概念格裁剪得到的iceberg格同构;然后分析了iceberg概念格集成过程中的映射关系;最终提出一个新颖的基于iceberg概念格并置的闭频繁项集挖掘算法(Icegalamera).此算法避免了完备概念格的计算,并且在构造过程中采用集成和剪枝策略,从而显著提高了挖掘效率.实验证明其产生的闭频繁项集的完备性.使用稠密和稀疏数据集在单站点模式下进行了性能测试,结果表明稀疏数据集上性能优势明显.

关 键 词:iceberg概念格  集成  闭频繁项集  分布式数据挖掘  形式概念分析  iceberg  概念格  集成过程  频繁项集挖掘算法  Lattices  Concept  Assembly  Based  Closed  Frequent  Itemsets  Mining  性能优势  结果  性能测试  点模式  单站  稀疏数据集  使用  闭频繁项集  验证  挖掘效率
修稿时间:2007-03-14

An Algorithm for Mining Closed Frequent Itemsets Based on Apposition Assembly of Iceberg Concept Lattices
Wang Liming,Zhang Zhuo.An Algorithm for Mining Closed Frequent Itemsets Based on Apposition Assembly of Iceberg Concept Lattices[J].Journal of Computer Research and Development,2007,44(7):1184-1190.
Authors:Wang Liming  Zhang Zhuo
Affiliation:School of Information Engineering, Zhengzhou University, Zhengzhou 450052
Abstract:Formal concept analysis which is an unsupervised learning method for conceptual clustering constitutes an appropriate framework for data mining.However,due to the completeness of concept lattice,the task of constructing the lattice is known to be computationally expensive.The iceberg lattice of context,a substructure of the complete concept lattice,served as a condensed representation of frequent itemsets.And it is well suited for analyzing very large database.And building concept lattice by merging factor lattices drawn from data fragments may be adapted to distributed data mining environment.Inspired by those ideas,a novel algorithm called Icegalamera for iceberg concept lattice assembly from heterogeneous relational tables is presented and is utilized for closed frequent itemsets mining.The completeness of closed frequent itemsets produced by Icegalamera is proved both in theory and in empirical way,and then the merge mapping process is analyzed and implemented from partial iceberg concept lattices to global one.This algorithm avoids computation of structuring the complete concept lattice.Furthermore the merge and pruning strategies are adopted,which makes the algorithm's efficiency outperforms that of the apriori algorithm on generating frequent itemsets under condense and sparse data set.
Keywords:iceberg concept lattice  merge  closed frequent itemset  distributed data mining  formal concept analysis
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号