首页 | 本学科首页   官方微博 | 高级检索  
     

一种挖掘带时间约束序列模式的改进算法
引用本文:胡学钢,张圆圆.一种挖掘带时间约束序列模式的改进算法[J].智能系统学报,2007,2(2):89-93.
作者姓名:胡学钢  张圆圆
作者单位:合肥工业大学,计算机与信息学院,安徽,合肥,230009
基金项目:安徽省自然科学基金资助项目(050420207).
摘    要:针对带时间约束的序列模式,提出了一种改进的挖掘算法TSPM,克服了传统的序列模式挖掘方法时空开销大,结果数量巨大且缺少针对性的缺陷.算法引入图结构表示频繁2序列,仅需扫描一次数据库,即可将与挖掘任务相关的信息映射到图中,图结构的表示使得挖掘过程可以充分利用项目之间的次序关系,提高了频繁序列的生成效率.另外算法利用序列的位置信息计算支持度,降低了处理时间约束的复杂性,避免了反复测试序列包含的过程.实验证明,该算法较传统的序列模式发现算法在时间和空间性能上具有优越性。

关 键 词:数据挖掘  序列模式  时间约束
文章编号:1673-4785(2007)02-0089-05
修稿时间:2006-06-20

An improved algorithm for mining sequential patterns with time constraints
HU Xue-gang,ZHANG Yuan-yuan.An improved algorithm for mining sequential patterns with time constraints[J].CAAL Transactions on Intelligent Systems,2007,2(2):89-93.
Authors:HU Xue-gang  ZHANG Yuan-yuan
Affiliation:School of Computer and Information, Hefei University of Technology, Hefei 230009, China
Abstract:An improved time constrained sequential pattern mining algorithm (TSPM) is proposed, overcoming the problem of traditional sequential mining algorithm whose performance is poor, and result is numerous and short of pertinence. Graph is introduced to express the frequent 2-sequence. It need scan the transaction database only once, then mapping information related to the mining task into graph. The graph representation can fully utilize the property of item order in the mining process, thus improving the generating efficiency of frequent sequences. Besides it makes use of the positional information of sequence to count support, therefore reducing the complexity of time constraints processing, and avoiding the process of testing whether a candidate sequence is contained in a data sequence. Experimental results prove the superiority of the algorithm in time and space performance.
Keywords:data mining  sequential pattern  time constrain
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号