首页 | 本学科首页   官方微博 | 高级检索  
     

类不均衡的半监督高斯过程分类算法
引用本文:夏战国,夏士雄,蔡世玉,万玲. 类不均衡的半监督高斯过程分类算法[J]. 通信学报, 2013, 34(5): 5-51. DOI: 10.3969/j.issn.1000-436x.2013.05.005
作者姓名:夏战国  夏士雄  蔡世玉  万玲
作者单位:中国矿业大学 计算机科学与技术学院,江苏 徐州 221116
基金项目:国家自然科学基金资助项目(50674086);国家教育部博士点基金资助项目(20110095110010)
摘    要:针对传统的监督学习方法难以解决真实数据集标记信息少、训练样本集中存在类不均衡的问题,提出了类不均衡的半监督高斯过程分类算法。算法引入自训练的半监督学习思想,结合高斯过程分类算法计算后验概率,向未标记数据中注入类标记以获得更多准确可信的标记数据,使得训练样本的类分布相对平衡,分类器自适应优化以获得较好的分类效果。实验结果表明,在类不均衡的训练样本及标记信息过少的情况下,该算法通过自训练分类器获得了有效标记,使分类精度得到了有效提高,为解决类不均衡数据分类提供了一个新的思路。

关 键 词:类不均衡;半监督;高斯过程分类;自训练

Semi-supervised Gaussian process classificationalgorithm addressing the class imbalance
Zhan-guo XIA,Shi-xiong XIA,Shi-yu CAI,Ling WAN. Semi-supervised Gaussian process classificationalgorithm addressing the class imbalance[J]. Journal on Communications, 2013, 34(5): 5-51. DOI: 10.3969/j.issn.1000-436x.2013.05.005
Authors:Zhan-guo XIA  Shi-xiong XIA  Shi-yu CAI  Ling WAN
Affiliation:School of Computer Science and Technology,China University of Mining and Technology,Xuzhou 221116,China
Abstract:The traditional supervised learning is difficult to deal with real-world datasets with less labeled information when the training sets class is imbalanced. Therefore, a new semi-supervised Gaussian process classification of addressing was proposed. The semi-supervised Gaussian process was realized by calculating the posterior probability to obtain more accurate and credible labeled data, and embarking from self-training semi-supervised methods to add class label into the unlabeled data. The algorithm makes the distribution of training samples relatively balance, so the classifier can adaptively optimized to obtain better effect of classification. According to the experimental results, when the circumstances of training set are class imbalance and much lack of label information, The algorithm improves the accuracy by obtaining effective labeled in comparison with other related works and provides a new idea for addressing the class imbalance is demonstrated.
Keywords:class imbalance   semi-supervised   Gaussian process classification   self-training
点击此处可从《通信学报》浏览原始摘要信息
点击此处可从《通信学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号