首页 | 本学科首页   官方微博 | 高级检索  
     

具备历史借鉴能力的软划分聚类模型
引用本文:孙寿伟,钱鹏江,陈爱国,蒋亦樟.具备历史借鉴能力的软划分聚类模型[J].计算机应用,2015,35(2):435-439.
作者姓名:孙寿伟  钱鹏江  陈爱国  蒋亦樟
作者单位:江南大学 数字媒体学院, 江苏 无锡 214122
基金项目:国家自然科学基金资助项目,江苏省自然科学基金资助项目,江苏省产学研前瞻性研究项目
摘    要:在数据稀少或失真等场景下,传统软划分聚类算法无法获得满意的聚类效果。为解决该问题,以极大熵聚类算法为基础,基于历史知识利用的途径,提出两种新的具备历史借鉴能力的软划分聚类模型(分别简称SPBC-RHK-1和SPBC-RHK-2)。SPBC-RHK-1是仅借鉴历史类中心的基础模型,SPBC-RHK-2则是以历史类中心和历史隶属度相融合为手段的高级模型。通过历史知识借鉴,两种模型的聚类有效性均得到有效提高,比较而言具备更高知识利用能力的SPBC-RHK-2模型在聚类有效性和鲁棒性上具有更好的表现。由于所用历史知识不暴露历史源数据,因此两种方法还具有良好的历史数据隐私保护效果。最后在模拟数据集和真实数据集上的实验验证了上述优点。

关 键 词:软划分聚类算法    信息缺失或失真    历史知识    知识利用    隐私保护
收稿时间:2014-09-22
修稿时间:2014-11-12

Soft partition based clustering models with reference to historical knowledge
SUN Shouwei,QIAN Pengjiang,CHEN Aiguo,JIANG Yizhang.Soft partition based clustering models with reference to historical knowledge[J].journal of Computer Applications,2015,35(2):435-439.
Authors:SUN Shouwei  QIAN Pengjiang  CHEN Aiguo  JIANG Yizhang
Affiliation:School of Digital Media, Jiangnan University, Wuxi Jiangsu 214122, China
Abstract:Conventional soft partition based clustering algorithms usually cannot achieve desired clustering outcomes in the situations where the data are quite spare or distorted. To address this problem, based on maximum entropy clustering, by means of the strategy of historical knowledge learning, two novel soft partition based clustering models called SPBC-RHK-1 and SPBC-RHK-2 for short respectively were proposed. SPBC-RHK-1 is the basic model which only refers to the historical cluster centroids, whereas SPBC-RHK-2 is of advanced modality based on the combination of historical cluster centroids and historical memberships. In terms of the historical knowledge, the effectiveness of both algorithms was improved distinctly, and SPBC-RHK-2 method showed better effectiveness and robustness compared to the other method since its higher ability of utilizing knowledge. In addition, because the involved historical knowledge does not expose the historical raw data, both of these two approaches have good capacities of privacy protection for historical data. Finally, experiments were conducted on both artificial and real-world datasets to verify above merits.
Keywords:soft partition based clustering algorithm  impure data  historical knowlege  knowlege learning  privacy protection
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号