首页 | 本学科首页   官方微博 | 高级检索  
     

基于超图正则化受限的概念分解算法
引用本文:李雪, 赵春霞, 舒振球, 郭剑辉. 基于超图正则化受限的概念分解算法[J]. 电子与信息学报, 2015, 37(3): 509-515. doi: 10.11999/JEIT140799
作者姓名:李雪  赵春霞  舒振球  郭剑辉
作者单位:南京理工大学计算机科学与工程学院南京 210094
基金项目:国家自然科学基金,中国博士后科学基金,江苏省社会安全图像与视频理解重点实验室基金,江苏省普通高校研究生科研创新计划项目(KYLX_0383)资助课题
摘    要:针对概念分解(Concept Factorization, CF)算法没有同时考虑样本中存在的类别信息及数据间多元几何结构信息的问题,该文提出一种基于超图正则化受限的概念分解(Hyper-graph regularized Constrained Concept Factorization, HCCF)算法。HCCF算法通过构建一个无向加权的拉普拉斯超图正则项,提取数据间的多元几何结构信息,克服了传统图模型只能表达数据间成对关系的缺陷;同时采用硬约束的方式使样本的类别信息在低维空间中保持一致,充分利用了标记样本的类别信息。该文采用乘性迭代的方法求解HCCF算法的目标函数并证明了其收敛性。在TDT2库、Reuters库和PIE库上的实验结果表明,HCCF算法提高了聚类的准确率和归一化互信息,验证了算法的有效性。

关 键 词:信息处理   概念分解   聚类   硬约束   超图   流形学习
收稿时间:2014-06-17
修稿时间:2014-10-15

Hyper-graph Regularized Constrained Concept Factorization Algorithm
Li Xue, Zhao Chun-Xia, Shu Zhen-Qiu, Guo Jian-Hui. Hyper-graph Regularized Constrained Concept Factorization Algorithm[J]. Journal of Electronics & Information Technology, 2015, 37(3): 509-515. doi: 10.11999/JEIT140799
Authors:Li Xue  Zhao Chun-xia  Shu Zhen-qiu  Guo Jian-hui
Abstract:The Concept Factorization (CF) algorithm can not take into account the label information and the multi-relationship of samples simultaneously. In this paper, a novel algorithm called Hyper-graph regularized Constrained Concept Factorization (HCCF) is proposed, which extracts the multi-geometry information of samples by constructing an undirected weighted hyper-graph Laplacian regularize term, hence overcomes the deficiency that traditional graph model expresses pair-wise relationship only. Meanwhile, HCCF takes full advantage of the label information of labeled samples as hard constraints, and it preserves label consistent in low-dimensional space. The objective function of HCCF is solved by the iterative multiplicative updating algorithm and its convergence is also proved. The experimental results on TDT2, Reuters, and PIE data sets show that the proposed approach achieves better clustering performance in terms of accuracy and normalized mutual information, and the effectiveness of the proposed approach is verified.
Keywords:Information processing  Concept Factorization(CF)  Cluster  Hard constraints  Hyper-graph  Manifold learning
本文献已被 万方数据 等数据库收录!
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号