首页 | 本学科首页   官方微博 | 高级检索  
     

基于概率阈值Bagging算法的不平衡数据分类方法
引用本文:张忠林,吴挡平. 基于概率阈值Bagging算法的不平衡数据分类方法[J]. 计算机工程与科学, 2019, 41(6): 1086-1094
作者姓名:张忠林  吴挡平
作者单位:(兰州交通大学电子与信息工程学院,甘肃 兰州 730070)
基金项目:国家自然科学基金(61662043)
摘    要:类别不平衡问题广泛存在于现实生活中,多数传统分类器假定类分布平衡或误分类代价相等,因此类别不平衡数据严重影响了传统分类器的分类性能。针对不平衡数据集的分类问题,提出了一种处理不平衡数据的概率阈值Bagging分类方法-PT Bagging。将阈值移动技术与Bagging集成算法结合起来,在训练阶段使用原始分布的训练集进行训练,在预测阶段引入决策阈值移动方法,利用校准的后验概率估计得到对不平衡数据分类的最大化性能测量。实验结果表明,PT Bagging算法具有更好的处理不平衡数据的分类优势。

关 键 词:不平衡数据  阈值移动  Bagging集成学习  后验概率  
收稿时间:2018-06-11
修稿时间:2019-06-25

An imbalanced data classification methodbased on probability threshold Bagging
ZHANG Zhong lin,WU Dang ping. An imbalanced data classification methodbased on probability threshold Bagging[J]. Computer Engineering & Science, 2019, 41(6): 1086-1094
Authors:ZHANG Zhong lin  WU Dang ping
Affiliation:(School of Electronic and Information Engineering,Lanzhou Jiaotong University,Lanzhou 730070,China)
Abstract:The category imbalance problem exists widely in real life. Most of the traditional classifiers assume balanced class distribution or equal misclassification cost. However, when dealing with unbalanced data, their classification performance is seriously affected. Aiming at the classification problem of imbalanced data sets, we propose a probability threshold Bagging classification algorithm, called PT-Bagging to deal with unbalanced data. The algorithm combines the threshold-moving technique with the bagging ensemble algorithm, uses the original distributed training set for training in the training phase, introduces a decision threshold-moving method in the prediction phase, and employs the calibrated posterior probability estimation to obtain the maximized average performance measurement of the imbalanced data classification. Experimental results show that the PT-Bagging algorithm can better classify imbalanced data.
Keywords:imbalanced data  threshold moving  Bagging integrated learning  posterior probability  
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号