Linguistic frequent pattern mining using a compressed structure期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Linguistic frequent pattern mining using a compressed structure

Authors:	Lin Jerry Chun-Wei Ahmed Usman Srivastava Gautam Wu Jimmy Ming-Tai Hong Tzung-Pei Djenouri Youcef

Affiliation:	1.Department of Computer Science, Electrical Engineering and Mathematical Sciences, Western Norway University of Applied Sciences, Bergen, Norway ;2.Department of Mathematics & Computer Science, Brandon University, Brandon, Canada ;3.Research Centre for Interneural Computing, China Medical University, Taichung, Taiwan ;4.College of Computer Science and Engineering, Shandong University of Science & Technology, Shandong, China ;5.Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan ;6.SINTEF Digital, Mathematics and Cybernetics, Oslo, Norway ;

Abstract:	Traditional association-rule mining (ARM) considers only the frequency of items in a binary database, which provides insufficient knowledge for making efficient decisions and strategies. The mining of useful information from quantitative databases is not a trivial task compared to conventional algorithms in ARM. Fuzzy-set theory was invented to represent a more valuable form of knowledge for human reasoning, which can also be applied and utilized for quantitative databases. Many approaches have adopted fuzzy-set theory to transform the quantitative value into linguistic terms with its corresponding degree based on defined membership functions for the discovery of FFIs, also known as fuzzy frequent itemsets. Only linguistic terms with maximal scalar cardinality are considered in traditional fuzzy frequent itemset mining, but the uncertainty factor is not involved in past approaches. In this paper, an efficient fuzzy mining (EFM) algorithm is presented to quickly discover multiple FFIs from quantitative databases under type-2 fuzzy-set theory. A compressed fuzzy-list (CFL)-structure is developed to maintain complete information for rule generation. Two pruning techniques are developed for reducing the search space and speeding up the mining process. Several experiments are carried out to verify the efficiency and effectiveness of the designed approach in terms of runtime, the number of examined nodes, memory usage, and scalability under different minimum support thresholds and different linguistic terms used in the membership functions.

Keywords:
本文献已被 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏