首页 | 本学科首页   官方微博 | 高级检索  
     

子空间聚类的非参数模型及变分贝叶斯学习
引用本文:卿湘运,王行愚.子空间聚类的非参数模型及变分贝叶斯学习[J].计算机学报,2007,30(8):1333-1343.
作者姓名:卿湘运  王行愚
作者单位:华东理工大学信息科学与工程学院,上海,200237
基金项目:国家自然科学基金 , 高等学校博士学科点专项科研项目
摘    要:子空间聚类的目标是在不同的特征子集上对给定的一组数据归类.此非监督学习方法试图发现数据"在不同表达下的相似"模式,并且引起了相关领域大量的关注和研究.首先扩展Hoff提出的"均值与方差平移"模型为一个新的基于特征子集的非参数聚类模型,其优点是能应用变分贝叶斯方法学习模型参数.此模型结合Dirichlet过程混合模型和选择特征子集的非参数模型,能自动选择聚类个数和进行子空间聚类.然后给出基于马尔可夫链蒙特卡罗的参数后验推断算法.出于计算速度上的考虑,提出应用变分贝叶斯方法学习模型参数.在仿真数据上的实验结果及在人脸聚类问题上的应用均表明了此模型能同时选择相关特征和在这些特征上具有相似模式的数据点.在UCI"多特征数据库"上应用无需抽样的变分贝叶斯方法,其实验结果说明此方法能快速推断模型参数.

关 键 词:混合模型  Dirichlet过程  非参数贝叶斯  马尔可夫链蒙特卡罗  变分学习  子空间聚类  非参数模型  变分  贝叶斯  学习  Clustering  Subspace  Bayesian  Learning  Variational  Model  快速  抽样  特征数据库  相似模式  相关特征  聚类问题  人脸  结果  实验  仿真数据
修稿时间:2007-02-05

Nonparametric Model and Variational Bayesian Learning for Subspace Clustering
QING Xiang-Yun,WANG Xing-Yu.Nonparametric Model and Variational Bayesian Learning for Subspace Clustering[J].Chinese Journal of Computers,2007,30(8):1333-1343.
Authors:QING Xiang-Yun  WANG Xing-Yu
Affiliation:College of Information Science and Technology, East China University of Science and Technology, Shanghai 200237
Abstract:The goal of subspace clustering is to group a given set of data represented by different feature subsets. As an unsupervised learning method, subspace clustering tries to discover the patterns of "similarity examined under different presentations" and has received a great deal of interest and research in the related domains. Firstly the "mean and variance shift" model proposed by Hoff is extended to a new nonparametric model of subspace clustering based on subsets of features.The advantage of the model is that variational Bayesian method can be applied. The model based on the integration of a Dirichlet process mixture model and a nonparametric model of selecting subsets of features can automatically choose the number of clusters and perform subspace clustering. Then posterior inference of the model is done using Markov Chain Monte Carlo. Due to computational considerations the authors propose a variational Bayesian method to learn the parameters of the model. Experimental results using simulated data and the application to the problem of clustering face images illustrate the model can simultaneously selecting the relevant features and the data points that have similar pattern under these features. Experiments on the "multiple feature database" from the UCI repository show that variational Bayesian method without sampling can fleetly inference the parameters of the model.
Keywords:mixture model  Dirichlet process  nonparametric Bayes  Markov chain Monte Carlo  variational learning
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号