Clustering Analysis of Gene Expression Data based on Semi-supervised Visual Clustering Algorithm |
| |
Authors: | Fu-lai Chung Shitong Wang Zhaohong Deng Chen Shu D. Hu |
| |
Affiliation: | (1) Department of Computing, Hong Kong Polytechnic University, Hong Kong, China;(2) School of Information Engineering, Southern Yangtze University, Wuxi, China;(3) School of Automation, National Defense University of Science and Technology, Changsha, China |
| |
Abstract: | When gene expression datasets contain some labeled data samples, the labeled information should be incorporated into clustering algorithm such that more reasonable clustering results can be achieved. In this paper, a novel semi-supervised clustering algorithm, Semi-supervised Iterative Visual Clustering Algorithm (Semi-IVCA), is presented to tackle with such datasets. The new algorithm first constructs the visual sampling image of the dataset based on visual theorem and obtains its attractors using the gradient learning rules, where each attractor denotes a cluster of the dataset. Then the new algorithm introduces an iterative clustering procedure to realize the semi-supervised learning. The new algorithm is a generalization of the current Visual Clustering Algorithm (VCA) presented by authors. Except for the advantage that Semi-IVCA can effectively utilize the labeled data information in clustering, it is robust and insensitive to initialization, and it has strong parameter learning capability and good interpretation for the clustering results. When the new algorithm Semi-IVCA is applied to the artificial and real gene expression datasets, the experimental results confirm the above advantages of algorithm Semi-IVCA. |
| |
Keywords: | Semi-supervised learning Visual clustering Clustering analysis Gene expression data Gradient system |
本文献已被 SpringerLink 等数据库收录! |
|