首页 | 本学科首页   官方微博 | 高级检索  
     


Linguistic frequent pattern mining using a compressed structure
Authors:Lin  Jerry Chun-Wei  Ahmed  Usman  Srivastava  Gautam  Wu  Jimmy Ming-Tai  Hong  Tzung-Pei  Djenouri  Youcef
Affiliation:1.Department of Computer Science, Electrical Engineering and Mathematical Sciences, Western Norway University of Applied Sciences, Bergen, Norway
;2.Department of Mathematics & Computer Science, Brandon University, Brandon, Canada
;3.Research Centre for Interneural Computing, China Medical University, Taichung, Taiwan
;4.College of Computer Science and Engineering, Shandong University of Science & Technology, Shandong, China
;5.Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan
;6.SINTEF Digital, Mathematics and Cybernetics, Oslo, Norway
;
Abstract:

Traditional association-rule mining (ARM) considers only the frequency of items in a binary database, which provides insufficient knowledge for making efficient decisions and strategies. The mining of useful information from quantitative databases is not a trivial task compared to conventional algorithms in ARM. Fuzzy-set theory was invented to represent a more valuable form of knowledge for human reasoning, which can also be applied and utilized for quantitative databases. Many approaches have adopted fuzzy-set theory to transform the quantitative value into linguistic terms with its corresponding degree based on defined membership functions for the discovery of FFIs, also known as fuzzy frequent itemsets. Only linguistic terms with maximal scalar cardinality are considered in traditional fuzzy frequent itemset mining, but the uncertainty factor is not involved in past approaches. In this paper, an efficient fuzzy mining (EFM) algorithm is presented to quickly discover multiple FFIs from quantitative databases under type-2 fuzzy-set theory. A compressed fuzzy-list (CFL)-structure is developed to maintain complete information for rule generation. Two pruning techniques are developed for reducing the search space and speeding up the mining process. Several experiments are carried out to verify the efficiency and effectiveness of the designed approach in terms of runtime, the number of examined nodes, memory usage, and scalability under different minimum support thresholds and different linguistic terms used in the membership functions.

Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号