首页 | 本学科首页   官方微博 | 高级检索  
     

基于图论的频繁模式挖掘
引用本文:汪卫,周皓峰,袁晴晴,楼宇波,施伯乐.基于图论的频繁模式挖掘[J].计算机研究与发展,2005,42(2):230-235.
作者姓名:汪卫  周皓峰  袁晴晴  楼宇波  施伯乐
作者单位:复旦大学计算机与信息技术系,上海,200433
基金项目:国家自然科学基金项目 (6993 3 0 10 ,60 3 0 3 0 0 8),国家“八六三”高技术研究发展计划基金项目 (2 0 0 2AA4Z3 43 0 )
摘    要:对图数据频繁模式的挖掘是近年的研究热点.选择了惟一标号图进行分析,结合图论和频集生成的算法,提出了基于Aproiri思想、运用矩阵乘法的AMGM算法和基于SFP树的SFP算法.它们可有效地挖掘简单图中连通频繁子图.实验表明,这两个算法是十分有效的,其中SFP算法的性能优于AMGM.该算法还被运用于发现Web上的权威页面和社团,具有良好的效果.

关 键 词:SFP树  频繁连通图  数据挖掘

Mining Frequent Patterns Based on Graph Theory
Wang Wei,Zhou Haofeng,Yuan Qingqing,Lou Yubo,Sui Baile.Mining Frequent Patterns Based on Graph Theory[J].Journal of Computer Research and Development,2005,42(2):230-235.
Authors:Wang Wei  Zhou Haofeng  Yuan Qingqing  Lou Yubo  Sui Baile
Abstract:Mining the frequent pattern from data set is one of the key success stories of data mining research. Currently, most of the efforts are focused on the independent data such as the items in the marketing basket. However, the objects in the real world often have close relationship with each other. How to gain the frequent pattern from these relations is the objective of this paper. Graphs are used to model the relations, and a simple type is selected for analysis. Combining the graph-theory and algorithms to generate frequent patterns, two new algorithms are proposed. The first algorithm, named AMGM, is based on the Aproiri idea and makes use of matrix. For the second algorithm, a new structure SFP-tree and an algorithm, which can mine these simple graphs more efficiently, have been proposed. The performance of the algorithms is evaluated by experiments with synthetic datasets. The empirical results show that they both can do the job well, while SFP performs better than AMGM. Such algorithms are also applied in mining of the authoritative pages and communities on Web, which is useful for Web mining. At the end of the paper, the potential improvement is mentioned.
Keywords:SFP tree  connected frequent graph  data mining  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号