首页 | 本学科首页   官方微博 | 高级检索  
     

流形上的非线性判别K均值聚类
引用本文:高丽平,周雪燕,詹宇斌.流形上的非线性判别K均值聚类[J].计算机应用,2011,31(12):3247-3251.
作者姓名:高丽平  周雪燕  詹宇斌
作者单位:1. 中原工学院 计算机学院,郑州 4500002. 国防科学技术大学 计算机学院, 长沙 410073
基金项目:国家自然科学基金资助项目,河南省科技攻关计划项目
摘    要:为提高具有流形结构的高维数据的聚类性能,提出非线性判别K均值聚类算法(NDisKmeans)。该方法通过引入流形上的谱正则化技术,将数据的低维嵌入表示成数据流形上平滑函数的线性组合,然后通过最大化低维空间中聚类类间的散度与总体散度的比值,来实现对高维数据的聚类。还设计了一种收敛的迭代求解方法来求解最优组合系数矩阵和聚类赋值矩阵。NDisKmeans方法由于考虑了数据的流形结构,克服了判别K均值算法中线性映射的不足,从而提高了对高维数据聚类的性能。最后在数据集上的广泛实验表明,NDisKmeans方法能有效实现对高维数据的聚类。

关 键 词:聚类    流形    K均值聚类    谱正则化    谱聚类
收稿时间:2011-07-04
修稿时间:2011-08-16

Nonlinear discriminant K-means clustering on manifold
GAO Li-ping,ZHOU Xue-yan,ZHAN Yu-bin.Nonlinear discriminant K-means clustering on manifold[J].journal of Computer Applications,2011,31(12):3247-3251.
Authors:GAO Li-ping  ZHOU Xue-yan  ZHAN Yu-bin
Affiliation:1. School of Computer, Zhongyuan University of Technology, Zhengzhou Henan 450000, China2. School of Computer, National University of Defense Technology, Changsha Hunan 410073, China
Abstract:In real applications in pattern recognition and computer vison, high dimensional data always lie approximately on a low dimensional manifold. How to improve the performance of clustering algorithm on high dimensional data by using the manifold structure is a research hotspot in machine learning and data mining community. In this paper, a novel clustering algorithm called Nonlinear Discriminant K-means Clustering (NDisKmeans), which has taken the manifold structure of high dimensional into account, is proposed. By introducing the spectracl regularization technology, NDisKmeans first represents the desired low dimensional coordinates as linear combinations of smooth vectors predefined on the data manifold; then maximizes the ratio between inter-clusters scatter and total scatter to cluster the high dimensional data. A convergent iterative procedure is devised to solute the matrix of the combination coefficient and clustering assignment matrix. NDisKmeans overcomes the limilation of linear mapping of DisKmeans algorithm; therefore, it significantly improves the clustering performance. The systematic and extensive experiments on UCI and real world data sets have shown the effectiveness of the proposed NDisKmeans method.
Keywords:clustering                                                                                                                        manifold                                                                                                                        K-means clustering                                                                                                                        spectral regularization                                                                                                                        spectral clustering
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号