首页 | 本学科首页   官方微博 | 高级检索  
     

基于隐含变量的聚类集成模型
引用本文:王红军,李志蜀,成飏,周鹏,周维.基于隐含变量的聚类集成模型[J].软件学报,2009,20(4):825-833.
作者姓名:王红军  李志蜀  成飏  周鹏  周维
作者单位:四川大学,计算机学院,四川,成都,610054
基金项目:Supported by the China Scholarship Council Foundation under Grant No.2007U24068 (国家留学基金委员会资助项目)
摘    要:聚类集成能成为机器学习活跃的研究热点,是因为聚类集成能够保护私有信息、分布式处理数据和对知识进行重用,此外,噪声和孤立点对结果的影响较小.主要工作包括:第一,分析了把每一个基聚类器看成是原数据的一个属性这种处理方式的优越性,发现按此方法建立起来的聚类集成算法就具有良好的扩展性和灵活性;第二,在此基础之上,建立了latent variable cluster ensemble(LVCE)概率模型进行聚类集成,并且给出了LVCE 模型的Markovchain Monte Carlo(MCMC)算法.实验结果表明,LVCE 模型的MCMC 算法能够进行聚类集成并且达到良好的效果,同时可以体现数据聚类的紧密程度.

关 键 词:聚类集成  隐含变量  聚类集成模型
收稿时间:2008/3/13 0:00:00
修稿时间:2008/8/11 0:00:00

A Latent Variable Model for Cluster Ensemble
WANG Hong-Jun,LI Zhi-Shu,CHENG Yang,ZHOU Peng and ZHOU Wei.A Latent Variable Model for Cluster Ensemble[J].Journal of Software,2009,20(4):825-833.
Authors:WANG Hong-Jun  LI Zhi-Shu  CHENG Yang  ZHOU Peng and ZHOU Wei
Affiliation:School of Computer Science;Sichuan University;Chengdu 610054;China
Abstract:Cluster ensemble becomes a research focus due to its success in privacy protection, distributing computing and reusing knowledge. Furthermore, the noise and isolation have little effect on the final result. Thereare two contributions in this paper. First, by regarding every base clustering as one attribute of the original data, it has found that the algorithm based on that is more extendable and flexible. Second, it designs a latent variable cluster ensemble (LVCE) model in this way and infers the algorithm of the model with Markov chain Monte Carlo (MCMC) approximation. At the end of the paper, the experimental results show that the MCMC algorithm of LVCE has a better result and can show the compactedness of data points clustering.
Keywords:MCMC(Markov chain Monte Carlo)
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号