首页 | 本学科首页   官方微博 | 高级检索  
     

调整聚类假设联合成对约束半监督分类方法
引用本文:黄华,郑佳敏,钱鹏江.调整聚类假设联合成对约束半监督分类方法[J].计算机应用,2018,38(11):3119-3126.
作者姓名:黄华  郑佳敏  钱鹏江
作者单位:江南大学 数字媒体学院, 江苏 无锡 214122
基金项目:国家自然科学基金资助项目(61772241,61702225);中央高校基本科研专项资金资助重点A类项目(JUSRP51614A);江苏省青蓝工程项目; 2016年江苏省"六大人才高峰"高层次人才项目(2016-XYDXXJS-014)。
摘    要:当不同类别的样本严重重叠在分类边界时,由于聚类假设不能很好地反映出数据的真实分布,基于聚类假设的半监督分类方法的性能,可能比与之对立的监督分类方法更差。针对上述不安全的半监督分类问题,提出了调整聚类假设联合成对约束半监督分类方法(ACA-JPC-S3VM)。一方面,它将单个未标记样本到数据分布边界的距离融入到模型的学习中,能够一定程度上缓解此类情况下算法性能的下降程度;另一方面,它将成对约束信息引入,弥补了模型对监督信息利用方面的不足。在UCI数据集上的实验结果表明,ACA-JPC-S3VM方法的性能绝不会低于支持向量机(SVM),且在标记样本数量为10时的平均准确率较SVM高出5个百分点;在图像分类数据集上的实验结果表明,直推式支持向量机(TSVM)等半监督分类方法出现了不同程度的不安全学习情形(即性能相近或低于SVM),而ACA-JPC-S3VM却能安全地学习。因此,ACA-JPC-S3VM具有更好的安全性与正确性。

关 键 词:半监督学习  分类  聚类假设  调整聚类假设  成对约束  
收稿时间:2018-04-28
修稿时间:2018-06-12

Adjusted cluster assumption and pairwise constraints jointly based semi-supervised classification method
HUANG Hua,ZHENG Jiamin,QIAN Pengjiang.Adjusted cluster assumption and pairwise constraints jointly based semi-supervised classification method[J].journal of Computer Applications,2018,38(11):3119-3126.
Authors:HUANG Hua  ZHENG Jiamin  QIAN Pengjiang
Affiliation:School of Digital Media, Jiangnan University, Wuxi Jiangsu 214122, China
Abstract:When samples from different classes over classification boundary are seriously overlapped, cluster assumption may not well reflect the real data distribution, so that semi-supervised classification methods based cluster assumption may yield even worse performance than their supervised counterparts. For the above unsafe semi-supervised classification problem, an Adjusted Cluster Assumption and Pairwise Constraints Jointly based Semi-Supervised Support Vector Machine classification method (ACA-JPC-S3VM) was proposed. On the one hand, the distances of individual unlabeled instances to the distribution boundary were considered in learning, which alleviated the degradation of the algorithm performance in such cases to some extent. On the other hand, the information of pairwise constraints was introduced to the algorithm to make up for its insufficient use of supervision information. The experimental results on the UCI dataset show that the performance of ACA-JPC-S3VM method would never be lower than that of SVM (Support Vector Machine), and the average accuracy is 5 percentage points higher than that of SVM when the number of labeled samples is 10. The experimental results on the image classification dataset show that the semi-supervised classification methods such as TSVM (Transductive SVM) have different degrees of unsafety learning (similar or worse performance than SVM) while ACA-JPC-S3VM can learn safely. Therefore, ACA-JPC-S3VM has better safety and correctness.
Keywords:semi-supervised learning                                                                                                                        classification                                                                                                                        cluster assumption                                                                                                                        adjusted cluster assumption                                                                                                                        pairwise constraint
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号