首页 | 本学科首页   官方微博 | 高级检索  
     

Analyzing Sequential Patterns in Retail Databases
作者姓名:Unil  Yun
作者单位:Electronics and Telecommunications Research Institute, Telematics & USN Research Division LBS/Telematics Convergence Research Team, 161 Gajeong-dong, Yuseong-gu, Daejeon, 305-700, Korea
摘    要:Finding correlated sequential patterns in large sequence databases is one of the essential tasks in data mining since a huge number of sequential patterns are usually mined, but it is hard to find sequential patterns with the correlation. According to the requirement of real applications, the needed data analysis should be different. In previous mining approaches, after mining the sequential patterns, sequential patterns with the weak affinity are found even with a high minimum support. In this paper, a new framework is suggested for mining weighted support affinity patterns in which an objective measure, sequential ws-confidence is developed to detect correlated sequential patterns with weighted support affinity patterns. To efficiently prune the weak affinity patterns, it is proved that ws-confidence measure satisfies the anti-monotone and cross weighted support properties which can be applied to eliminate sequential patterns with dissimilar weighted support levels. Based on the framework, a weighted support affinity pattern mining algorithm (WSMiner) is suggested. The performance study shows that WSMiner is efficient and scalable for mining weighted support affinity patterns.

关 键 词:数据库  数据挖掘  连续模式挖掘算法  数据分析
收稿时间:3 September 2006
修稿时间:2006-09-022006-11-07

Analyzing Sequential Patterns in Retail Databases
Unil Yun.Analyzing Sequential Patterns in Retail Databases[J].Journal of Computer Science and Technology,2007,22(2):287-296.
Authors:Unil Yun
Affiliation:(1) Electronics and Telecommunications Research Institute, Telematics & USN Research Division, LBS/Telematics Convergence Research Team, 161 Gajeong-dong, Yuseong-gu, Daejeon, 305-700, Korea
Abstract:Finding correlated sequential patterns in large sequence databases is one of the essential tasks in data mining since a huge number of sequential patterns are usually mined, but it is hard to find sequential patterns with the correlation. According to the requirement of real applications, the needed data analysis should be different. In previous mining approaches, after mining the sequential patterns, sequential patterns with the weak affinity are found even with a high minimum support. In this paper, a new framework is suggested for mining weighted support affinity patterns in which an objective measure, sequential ws-confidence is developed to detect correlated sequential patterns with weighted support affinity patterns. To efficiently prune the weak affinity patterns, it is proved that ws-confidence measure satisfies the anti-monotone and cross weighted support properties which can be applied to eliminate sequential patterns with dissimilar weighted support levels. Based on the framework, a weighted support affinity pattern mining algorithm (WSMiner) is suggested. The performance study shows that WSMiner is efficient and scalable for mining weighted support affinity patterns.
Keywords:data mining  sequential pattern mining  sequential ws-confidence  weighted support affinity
本文献已被 CNKI 维普 万方数据 SpringerLink 等数据库收录!
点击此处可从《计算机科学技术学报》浏览原始摘要信息
点击此处可从《计算机科学技术学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号