首页 | 本学科首页   官方微博 | 高级检索  
     


Towards Safe Semi-supervised Classification: Adjusted Cluster Assumption via Clustering
Authors:Yunyun Wang  Yan Meng  Zhenyong Fu  Hui Xue
Affiliation:1.Department of Computer Science and Engineering,Nanjing University of Posts and Telecommunications,Nanjing,People’s Republic of China;2.Jiangsu Key Laboratory of Big Data Security and Intelligent Processing,Nanjing,People’s Republic of China;3.School of Computer Science and Engineering,Southeast University,Nanjing,People’s Republic of China
Abstract:Semi-supervised classification methods can perform even worse than the supervised counterparts in some cases. It undoubtedly reduces their confidence in real applications, and it is desired to improve the safety of semi-supervised classification such that it never performs worse than the supervised counterpart. Considering that the cluster assumption may not well reflect the real data distribution, which can be one possible cause of unsafe learning, we develop a safe semi-supervised support vector machine method in this paper by adjusting the cluster assumption (ACA-S3VM for short). Specifically, when samples from different classes are seriously overlapped, the real boundary actually lies not in the low density region, which will not be found by the cluster assumption. However, an unsupervised clustering method is able to detect the real boundary in this case. As a result, we design ACA-S3VM by adjusting the cluster assumption with the help of clustering, which considers the distances of individual unlabeled instances to the distribution boundary in learning. Empirical results show the competition of ACA-S3VM compared with the off-the-shelf safe semi-supervised classification methods.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号