首页 | 本学科首页   官方微博 | 高级检索  
     

高阶异构数据模糊联合聚类算法
引用本文:黄少滨,杨欣欣,申林山,李艳梅.高阶异构数据模糊联合聚类算法[J].通信学报,2014,35(6):3-24.
作者姓名:黄少滨  杨欣欣  申林山  李艳梅
作者单位:哈尔滨工程大学 计算机科学与技术学院,黑龙江 哈尔滨 150001
基金项目:国家自然科学基金资助项目(71272216, 60903080, 60093009);国家科技支撑计划基金资助项目(2009BAH42B02, 2012BAH08B02);博士后科学基金资助项目(2012M510480);中央高校基本科研业务费专项基金资助项目(HEUCFZ1212, HEUCFT1208)
摘    要:为了更有效地分析聚簇重叠部分高阶异构数据的聚簇结果,提出了一种高阶异构数据模糊联合聚类(HFCC)算法,该算法最小化每个特征空间中对象与聚簇中心的加权距离。推导出对象隶属度和特征权重的迭代更新公式,设计出聚类过程的迭代算法,并且从理论上证明了该迭代算法的收敛性。另外,通过泛化XB指标,提出适用于评估高阶异构数据聚类质量的指标GXB,用于判断聚簇数目。实验表明,HFCC算法能够有效探测数据内部隐藏的重叠聚簇结构,并且HFCC算法聚类效果明显优于5种有代表性的硬划分算法,此外GXB指标能够有效判定高阶异构数据的聚簇数目。

关 键 词:高阶异构数据  联合聚类  模糊聚类

Fuzzy co-clustering algorithm for high-order heterogeneous data
Shao-bin HUANG,Xin-xin YANG,Lin-shan SHEN,Yan-mei LI.Fuzzy co-clustering algorithm for high-order heterogeneous data[J].Journal on Communications,2014,35(6):3-24.
Authors:Shao-bin HUANG  Xin-xin YANG  Lin-shan SHEN  Yan-mei LI
Affiliation:College of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China
Abstract:In order to analysis the clustering results of high-oredr heterogeneous data at the overlaps of different clusters more efficiently, we developped a fuzzy co-clustering algorithm for high-order heterogeneous data (HFCC). HFCC algorithm minimized distances between objects and centers of clusters in each feature space. The update rules for fuzzy memberships of objects and weights of features were derived, and then an iterative algorithm was designed for the clustering process. Additionally, convergence of iterative algorithm was proved. In order to estimate the number of clusters, GXB validity index was proposed by generalizing the XB validity index, which could measure the quality of high-order clustering results. Finally, experimental results show that HFCC can effeciently mine the overlapped clusters and the qualities of clustering results of HFCC are superior five classical hard high-order co-clustering algorithms. Additionally, GXB validity index can effeciently estimate the number of high-order clusters.
Keywords:high-order heterogeneous data  co-clustering  fuzzy clustering
点击此处可从《通信学报》浏览原始摘要信息
点击此处可从《通信学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号