首页 | 本学科首页   官方微博 | 高级检索  
     

半监督学习在不平衡样本集分类中的应用研究
引用本文:于重重,商利利,谭 励,涂序彦,杨 扬.半监督学习在不平衡样本集分类中的应用研究[J].计算机应用研究,2013,30(4):1085-1089.
作者姓名:于重重  商利利  谭 励  涂序彦  杨 扬
作者单位:1. 北京工商大学 计算机与信息工程学院, 北京 100048; 2. 北京科技大学 计算机与通信工程学院, 北京 100083
基金项目:国家自然科学基金资助项目(61070182); 北京市组织部优秀人才资助项目(2010D005003000008); 北京市学科建设项目(PXM2012_014213_0000_74, PXM2012_014213_0000_23)
摘    要:在对不平衡样本集进行分类时容易产生少数类样误差大的问题,而目前半监督学习中的算法多数是针对未有明显此类特征的数据集。针对一种半监督协同分类算法在该问题上的有效性进行了研究。由于进一步增强了分类器差异性,该算法在理论上对不平衡样本集具有良好的分类性能。根据该算法建立分类模型,利用其对桥梁结构健康数据进行分类实验,与Tri-Training算法的结果比较表明,该算法对不平衡样本集具有良好的适用性,从而验证了上述算法的有效性。

关 键 词:不平衡样本集  半监督协同分类方法  分类器差异性  分类模型  桥梁结构健康数据

Semi-supervised learning in imbalanced sample set classification
YU Chong-chong,SHANG Li-li,TAN Li,TU Xu-yan,YANG Yang.Semi-supervised learning in imbalanced sample set classification[J].Application Research of Computers,2013,30(4):1085-1089.
Authors:YU Chong-chong  SHANG Li-li  TAN Li  TU Xu-yan  YANG Yang
Affiliation:1. School of Computer & Information Engineering, Beijing Technology & Business University, Beijing 100048, China; 2. School of Computer & Communication Engineering, University of Science & Technology Beijing, Beijing 100083, China
Abstract:Higher error rate emerged in the minority class of samples when make classification on imbalanced sample set, but most algorithms in semi-supervised learning are based on normal data set. This paper studied the effectiveness of a semi-supervised collaboration classification method. Because of the further enhanced classifier difference, this algorithm had good performance on classification of imbalanced sample set. It established classification model based on the above algorithm, and used this model to make classification with bridge structural health monitoring data, the compared results of which demonstrated the applicability to imbalanced sample set. Therefore it validated the effectiveness of the algorithm.
Keywords:imbalanced sample set  semi-supervised collaboration classification method  classifier difference  classification model  bridge structural health data
本文献已被 CNKI 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号