An effective parallel approach for genetic-fuzzy data mining期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

An effective parallel approach for genetic-fuzzy data mining

Affiliation:	1. Dept. of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan;2. Dept. of Information Management, Cheng Shiu University, Kaohsiung, Taiwan;3. Dept. of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan;1. Datameer, USA;2. Faculty of Computer Science, Otto-von-Guericke University, Magdeburg, Germany;3. University of Eichstätt-Ingolstadt, Germany;1. Badji Mokhtar University, LRS, Annaba, Algeria;2. Badji Mokhtar University, LRI, Annaba, Algeria;3. Université de Lorraine, LORIA, Nancy, France;4. CNRS UMR 7503, Nancy, France;5. Inria Nancy Grand Est, France;1. Fraunhofer INT, Appelsgarten 2, D-53879 Euskirchen, Germany;2. Ghent University, Faculty of Economics and Business Administration, Tweekerkenstraat 2, B-9000 Gent, Belgium;1. Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan;2. Department of Electrical Engineering, Chang Gung University, Taipei, Taiwan

Abstract:	Data mining is most commonly used in attempts to induce association rules from transaction data. In the past, we used the fuzzy and GA concepts to discover both useful fuzzy association rules and suitable membership functions from quantitative values. The evaluation for fitness values was, however, quite time-consuming. Due to dramatic increases in available computing power and concomitant decreases in computing costs over the last decade, learning or mining by applying parallel processing techniques has become a feasible way to overcome the slow-learning problem. In this paper, we thus propose a parallel genetic-fuzzy mining algorithm based on the master–slave architecture to extract both association rules and membership functions from quantitative transactions. The master processor uses a single population as a simple genetic algorithm does, and distributes the tasks of fitness evaluation to slave processors. The evolutionary processes, such as crossover, mutation and production are performed by the master processor. It is very natural and efficient to run the proposed algorithm on the master–slave architecture. The time complexities for both sequential and parallel genetic-fuzzy mining algorithms have also been analyzed, with results showing the good effect of the proposed one. When the number of generations is large, the speed-up can be nearly linear. The experimental results also show this point. Applying the master–slave parallel architecture to speed up the genetic-fuzzy data mining algorithm is thus a feasible way to overcome the low-speed fitness evaluation problem of the original algorithm.

Keywords:	Data mining Fuzzy set Genetic algorithm Parallel processing Association rule
本文献已被 ScienceDirect 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏