首页 | 本学科首页   官方微博 | 高级检索  
     


Mining top-k co-occurrence items with sequential pattern
Affiliation:1. Faculty of Information Technology, University of Science, VNU-HCM, Viet Nam;2. Faculty of Information Technology, Ho Chi Minh City University of Technology, Viet Nam;3. College of Electronics and Information Engineering, Sejong University, Seoul, Republic of Korea;4. Division of Data Science, Ton Duc Thang University, Ho Chi Minh City, Viet Nam;5. Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Viet Nam;6. Key Laboratory of Machine Perception (Ministry of Education), School of Electronics Engineering and Computer Science, Peking University, Beijing 100871, China;1. Reseach Scholar, Department of EEE, SV University, Tirupathi, Andhra Pradesh, India;2. Professor, Department of EEE, SV University, Tirupathi, Andhra Pradesh, India;2. Department of Computer and Information Science, Annamalai University, TamilNadu, India
Abstract:Frequent sequential pattern mining has become one of the most important tasks in data mining. It has many applications, such as sequential analysis, classification, and prediction. How to generate candidates and how to control the combinatorically explosive number of intermediate subsequences are the most difficult problems. Intelligent systems such as recommender systems, expert systems, and business intelligence systems use only a few patterns, namely those that satisfy a number of defined conditions. Challenges include the mining of top-k patterns, top-rank-k patterns, closed patterns, and maximal patterns. In many cases, end users need to find itemsets that occur with a sequential pattern. Therefore, this paper proposes approaches for mining top-k co-occurrence items usually found with a sequential pattern. The Naive Approach Mining (NAM) algorithm discovers top-k co-occurrence items by directly scanning the sequence database to determine the frequency of items. The Vertical Approach Mining (VAM) algorithm is based on vertical database scanning. The Vertical with Index Approach Mining (VIAM) algorithm is based on a vertical database with index scanning. VAM and VIAM use pruning strategies to reduce the search space, thus improving performance. VAM and VIAM are especially effective in mining the co-occurrence items of a long input pattern. The three algorithms were evaluated using real-world databases. The experimental results show that these algorithms perform well, especially VAM and VIAM.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号