首页 | 本学科首页   官方微博 | 高级检索  
     


Non-dominated sorting genetic algorithm using fuzzy membership chromosome for categorical data clustering
Affiliation:1. Centre for Biomedical Engineering, Transportation Research Alliance, Universiti Teknologi Malaysia, Skudai, Malaysia;2. Faculty of Bioscience and Medical Engineering, Universiti Teknologi Malaysia, Skudai, Malaysia;1. Department of Computer Engineering, Electrical Engineering and Applied Mathematics, University of Salerno, 84084 Fisciano, Italy;2. CORISA, Department of Computer Science, University of Salerno, 84084 Fisciano, Italy;1. School of Electrical Engineering and Automation, Tianjin Polytechnic University, Tianjin 300387, China;2. Tianjin Key Laboratory of Advanced Technology of Electrical Engineering and Energy, Tianjin 300387, China;1. College of Mathematics, Physics and Information Engineering, Jiaxing University, Jiaxing 314001, China;2. College of Engineering, Shaoxing University, Shaoxing 312000, China
Abstract:
In this research, a data clustering algorithm named as non-dominated sorting genetic algorithm-fuzzy membership chromosome (NSGA-FMC) based on K-modes method which combines fuzzy genetic algorithm and multi-objective optimization was proposed to improve the clustering quality on categorical data. The proposed method uses fuzzy membership value as chromosome. In addition, due to this innovative chromosome setting, a more efficient solution selection technique which selects a solution from non-dominated Pareto front based on the largest fuzzy membership is integrated in the proposed algorithm. The multiple objective functions: fuzzy compactness within a cluster (π) and separation among clusters (sep) are used to optimize the clustering quality. A series of experiments by using three UCI categorical datasets were conducted to compare the clustering results of the proposed NSGA-FMC with two existing methods: genetic algorithm fuzzy K-modes (GA-FKM) and multi-objective genetic algorithm-based fuzzy clustering of categorical attributes (MOGA (π, sep)). Adjusted Rand index (ARI), π, sep, and computation time were used as performance indexes for comparison. The experimental result showed that the proposed method can obtain better clustering quality in terms of ARI, π, and sep simultaneously with shorter computation time.
Keywords:Categorical attributes  Multi-objective optimization  Genetic algorithm  Fuzzy clustering
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号