首页 | 本学科首页   官方微博 | 高级检索  
     


Extended Gaussian kernel version of fuzzy c-means in the problem of data analyzing
Authors:S Ramathilagam  Yueh-Min Huang
Affiliation:1. Department of Engineering Science, National Cheng Kung University, No. 1, Ta-Hsueh Road, Tainan 701, Taiwan, ROC;2. Department of Applied Geoinformatics, Chia Nan University of Pharmacy & Science, No. 60, Erh-Jen RD., Sec.1, Jen-Te, Tainan 717, Taiwan, ROC;1. State Key Lab of Mechanical System and Vibration, School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China;2. Department of Systems Engineering & Engineering Management, City University of Hong Kong, Hong Kong, China;3. State Key Laboratory of High Performance Complex Manufacturing, Central South University, Changsha, Hunan, China;4. Department of Automation, Shanghai Jiao Tong University, Shanghai 200240, China;1. College of Life Science, Jilin University, Jilin, Changchun 130012, China;2. Ocean College, Hainan University, Hainan, Haikou 570228, China;1. Weather Information Service Engine Institute, Hankuk University of Foreign Studies, Republic of Korea;2. Department of Mathematics, Pusan National University, Busan, Republic of Korea
Abstract:Fuzzy c-means clustering with spatial constraints is considered as suitable algorithm for data clustering or data analyzing. But FCM has still lacks enough robustness to employ with noise data, because of its Euclidean distance measure objective function for finding the relationship between the objects. It can only be effective in clustering ‘spherical’ clusters, and it may not give reasonable clustering results for “non-compactly filled” spherical data such as “annular-shaped” data. This paper realized the drawbacks of the general fuzzy c-mean algorithm and it tries to introduce an extended Gaussian version of fuzzy C-means by replacing the Euclidean distance in the original object function of FCM. Firstly, this paper proposes initial kernel version of fuzzy c-means to aim at simplifying its computation and then extended it to extended Gaussian kernel version of fuzzy c-means. It derives an effective method to construct the membership matrix for objects, and it derives a robust method for updating centers from extended Gaussian version of fuzzy C-means. Furthermore, this paper proposes a new prototypes learning method and it obtains initial cluster centers using new mathematical initialization centers for the new effective objective function of fuzzy c-means, so that this paper tries to minimize the iteration of algorithms to obtain more accurate result. Initial experiment will be done with an artificially generated data to show how effectively the new proposed Gaussian version of fuzzy C-means works in obtaining clusters, and then the proposed methods can be implemented to cluster the Wisconsin breast cancer database into two clusters for the classes benign and malignant. To show the effective performance of proposed fuzzy c-means with new initialization of centers of clusters, this work compares the results with results of recent fuzzy c-means algorithm; in addition, it uses Silhouette method to validate the obtained clusters from breast cancer datasets.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号