首页 | 本学科首页   官方微博 | 高级检索  
     

一种改进的多视图聚类集成算法
引用本文:邓强,杨燕,王浩.一种改进的多视图聚类集成算法[J].计算机科学,2017,44(1):65-70.
作者姓名:邓强  杨燕  王浩
作者单位:西南交通大学信息科学与技术学院 成都610031,西南交通大学信息科学与技术学院 成都610031,西南交通大学信息科学与技术学院 成都610031
基金项目:本文受国家自然科学基金(61170111,61572407,61134002),国家科技支撑计划课题(2015BAH19F02),四川省科技支撑计划项目(2014SZ0207)资助
摘    要:近年来,针对大数据的数据挖掘技术和机器学习算法研究变得日趋重要。在聚类领域,随着多视图数据的大量出现,多视图聚类已经成为了一类重要的聚类方法。然而,大多数现有的多视图聚类算法受算法参数设置、数据样本等影响,具有聚类结果不稳定、参数需要反复调节等缺点。基于多视图K-means算法和聚类集成技术,提出了一种改进的多视图聚类集成算法,其提高了聚类的准确性、鲁棒性和稳定性。其次,由于单机环境下的多视图聚类算法难以对海量的数据进行处理,结合分布式处理技术,实现了一种分布式的多视图并行聚类算法。实验证明,并行算法在处理大数据时的时间效率有很大提升,适合于大数据环境下的多视图聚类分析。

关 键 词:多视图聚类  聚类集成  分布式计算  并行化
收稿时间:2015/9/21 0:00:00
修稿时间:2015/11/29 0:00:00

Improved Multi-view Clustering Ensemble Algorithm
DENG Qiang,YANG Yan and WANG Hao.Improved Multi-view Clustering Ensemble Algorithm[J].Computer Science,2017,44(1):65-70.
Authors:DENG Qiang  YANG Yan and WANG Hao
Affiliation:School of Information Science and Technology,Southwest Jiaotong University,Chengdu 610031,China,School of Information Science and Technology,Southwest Jiaotong University,Chengdu 610031,China and School of Information Science and Technology,Southwest Jiaotong University,Chengdu 610031,China
Abstract:In recent years,data mining and machine learning algorithms for big data become increasingly important.In the clustering,with the appearance of multi-view data,multi-view clustering has become an important clustering method.However,many existing multi-view clustering algorithms are easily affected by parameter setting and dataset itself,so the clustering results are usually unstable.To overcome this problem,we presented a new multi-view clustering ensemble algorithm based on the multi-view K-means clustering algorithm in this paper.This algorithm uses ensemble technique to improve the multi-view K-means algorithm performance,increasing the accuracy,robustness,and stability of clustering results.It is well known that one single computer cannot process too much data,because one computer has the limited computation resources.To improve the efficiency of multi-view clustering,we implemented a distributed multi-view clustering ensemble algorithm based on distributed processing technology.Experimental results show that the proposed approach has higher efficiency when processing large dataset,and it is suitable for multi-view clustering in big data environment.
Keywords:Multi-view clustering  Clustering ensemble  Distributed Computation  Parallelization
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号