首页 | 本学科首页   官方微博 | 高级检索  
     


Clustering Analysis of Gene Expression Data based on Semi-supervised Visual Clustering Algorithm
Authors:Fu-lai Chung  Shitong Wang  Zhaohong Deng  Chen Shu  D. Hu
Affiliation:(1) Department of Computing, Hong Kong Polytechnic University, Hong Kong, China;(2) School of Information Engineering, Southern Yangtze University, Wuxi, China;(3) School of Automation, National Defense University of Science and Technology, Changsha, China
Abstract:When gene expression datasets contain some labeled data samples, the labeled information should be incorporated into clustering algorithm such that more reasonable clustering results can be achieved. In this paper, a novel semi-supervised clustering algorithm, Semi-supervised Iterative Visual Clustering Algorithm (Semi-IVCA), is presented to tackle with such datasets. The new algorithm first constructs the visual sampling image of the dataset based on visual theorem and obtains its attractors using the gradient learning rules, where each attractor denotes a cluster of the dataset. Then the new algorithm introduces an iterative clustering procedure to realize the semi-supervised learning. The new algorithm is a generalization of the current Visual Clustering Algorithm (VCA) presented by authors. Except for the advantage that Semi-IVCA can effectively utilize the labeled data information in clustering, it is robust and insensitive to initialization, and it has strong parameter learning capability and good interpretation for the clustering results. When the new algorithm Semi-IVCA is applied to the artificial and real gene expression datasets, the experimental results confirm the above advantages of algorithm Semi-IVCA.
Keywords:Semi-supervised learning  Visual clustering  Clustering analysis  Gene expression data  Gradient system
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号