首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于数据相关性的半监督模糊聚类集成方法
引用本文:冯晨菲,杨燕,王红军,徐英歌,王韬.一种基于数据相关性的半监督模糊聚类集成方法[J].计算机科学,2015,42(6):41-45.
作者姓名:冯晨菲  杨燕  王红军  徐英歌  王韬
作者单位:西南交通大学信息科学与技术学院 成都610031
基金项目:本文受国家自然科学基金(61170111,2),西南交通大学牵引动力国家重点实验室自主研究课题(2012TPL_T15)资助
摘    要:现有的半监督聚类集成方法能利用先验信息,使集成的准确性、鲁棒性和稳定性得到提高,但在集成阶段加入成对约束信息时,只考虑了给定的约束信息而忽视了约束点与被约束点的邻域点之间的关系.针对此问题,提出了一种基于数据相关性的半监督模糊聚类集成方法.该方法首先利用半监督模糊聚类算法建立集成信息矩阵,并将其转换为相似性矩阵;然后,利用已知的约束信息及约束点与被约束点的邻域点之间的关系来修改相似性矩阵;最后,利用图划分算法得到最终的聚类结果.真实数据上的实验结果表明,提出的方法可以有效提高聚类质量.

关 键 词:半监督聚类集成  模糊聚类  成对约束  邻域点

Semi-supervised Fuzzy Clustering Ensemble Approach with Data Correlation
FENG Chen-fei,YANG Yan,WANG Hong-jun,XU Ying-ge and WANG Tao.Semi-supervised Fuzzy Clustering Ensemble Approach with Data Correlation[J].Computer Science,2015,42(6):41-45.
Authors:FENG Chen-fei  YANG Yan  WANG Hong-jun  XU Ying-ge and WANG Tao
Affiliation:School of Information Science and Technology,Southwest Jiaotong University,Chengdu 610031,China,School of Information Science and Technology,Southwest Jiaotong University,Chengdu 610031,China,School of Information Science and Technology,Southwest Jiaotong University,Chengdu 610031,China,School of Information Science and Technology,Southwest Jiaotong University,Chengdu 610031,China and School of Information Science and Technology,Southwest Jiaotong University,Chengdu 610031,China
Abstract:Semi-supervised clustering ensemble has emerged as a powerful machine learning paradigm that provides improved precision,robustness and stability by taking advantage of prior information,while most of them only consider the given pairwise constraints and do not consider the neighbors around the data points constrained in the ensemble step.In this paper,a semi-supervised fuzzy clustering ensemble with data correlation(SFCEDC)was proposed to overcome this defect.Firstly,an ensemble information matrix is built by primarily exploiting the results of semi-supervised fuzzy clustering and a similarity matrix is constructed by aggregating much information of the ensemble information matrix.And then this matrix is modified by using the given constraints and the neighbors around the data points constrained.Finally,a graph partitioning algorithm is employed to get the final clustering results.Experimental results on UCI datasets demonstrate that the proposed approach can improve clustering performance effectively.
Keywords:Semi-supervised clustering ensemble  Fuzzy clustering  Pairwise constraints  Neighbors points
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号