首页 | 本学科首页   官方微博 | 高级检索  
     

一种面向多类不平衡协议流量的改进AdaBoost.M2算法
引用本文:张仁斌,张杰,吴佩.一种面向多类不平衡协议流量的改进AdaBoost.M2算法[J].计算机应用研究,2019,36(6).
作者姓名:张仁斌  张杰  吴佩
作者单位:合肥工业大学计算机与信息学院,合肥,230009;合肥工业大学计算机与信息学院,合肥,230009;合肥工业大学计算机与信息学院,合肥,230009
摘    要:针对AdaBoost。M2算法在解决多类不平衡协议流量的分类问题时存在不足,提出一种适用于因特网协议流量多类不平衡分类的集成学习算法RBWS-ADAM2,本算法在AdaBoost。M2每次迭代过程中,设计了基于权重的随机平衡重采样策略对训练数据进行预处理,该策略利用随机设置采样平衡点的重采样方式来更改多数类和少数类的样本数目占比,以构建多个具有差异性的训练集,并将样本权重作为样本筛选的依据,尽可能保留高权重样本,以加强对此类样本的学习。在国际公开的协议流量数据集上将RBWS-ADAM2算法与其他类似算法进行实验比较表明,相比于其他算法,该算法不仅对部分少数类的F-measure有较大提升,更有效提高了集成分类器的总体G-mean和总体平均F-measure,明显增强了集成分类器的整体性能。

关 键 词:流量分类  集成学习算法  多类不平衡  泛化性能
收稿时间:2018/1/16 0:00:00
修稿时间:2019/5/5 0:00:00

Improved AdaBoost.M2 algorithm for multiclass imbalanced protocol traffic
zhangrenbin,zhangjie and wupei.Improved AdaBoost.M2 algorithm for multiclass imbalanced protocol traffic[J].Application Research of Computers,2019,36(6).
Authors:zhangrenbin  zhangjie and wupei
Affiliation:Hefei University of Technology,,
Abstract:The existing AdaBoost. M2 algorithm are insufficient in protocol traffic multiclass imbalance to solve the problem. So, this thesis proposes an ensemble algorithom called RBWS-ADAM2 for the classification of multiclass internet traffic. During each iteration of AdaBoost. M2, this algorithm preprocessed the training dataset by randomly balanced resampling, this strategy changed the number of majorities and minorities by randomly setting the sampling balance point to build multiple different training datasets. Moreover, this strategy toke sample weight as the basis for sample screening to strengthen the learning of this kind of sample. The experimental comparison of RBWS-ADAM2 algorithm and other similar algorithms on the internationally published protocol traffic datasets shows that, compared to other algorithms, the proposed RBWS-ADAM2 algorithm not only improves the F-Measure of most minorities, but increases the overall G-mean and the overall average F-measure effectively, and obviously enhances the overall performance of the ensemble classifier.
Keywords:traffic classification  ensemble algorithm  multiclass imbalance  generalization performance
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号