首页 | 本学科首页   官方微博 | 高级检索  
     

面向类不平衡网络流量的特征选择算法
引用本文:唐宏,刘丹,姚立霜,王云锋,裴作飞.面向类不平衡网络流量的特征选择算法[J].电子与信息学报,2021,43(4):923-930.
作者姓名:唐宏  刘丹  姚立霜  王云锋  裴作飞
作者单位:1.重庆邮电大学通信与信息工程学院 重庆 4000652.移动通信技术重庆市重点实验室 重庆 400065
基金项目:长江学者和创新团队发展计划(IRT_16R72)
摘    要:针对网络流量分类过程中出现的类不平衡问题,该文提出一种基于加权对称不确定性(WSU)和近似马尔科夫毯(AMB)的特征选择算法。首先,根据类别分布信息,定义了偏向于小类别的特征度量,使得与小类别具有强相关性的特征更容易被选择出来;其次,充分考虑特征与类别间、特征与特征之间的相关性,利用加权对称不确定性和近似马尔科夫毯删除不相关特征及冗余特征;最后,利用基于相关性度量的特征评估函数以及序列搜索算法进一步降低特征维数,确定最优特征子集。实验表明,在保证算法整体分类精确率的前提下,算法能够有效提高小类别的分类性能。

关 键 词:流量分类    特征选择    类不平衡    加权对称不确定性    近似马尔科夫毯
收稿时间:2019-12-11

Feature Selection Algorithm for Class Imbalanced Internet Traffic
Hong TANG,Dan LIU,LiShuang YAO,Yunfeng WANG,Zuofei PEI.Feature Selection Algorithm for Class Imbalanced Internet Traffic[J].Journal of Electronics & Information Technology,2021,43(4):923-930.
Authors:Hong TANG  Dan LIU  LiShuang YAO  Yunfeng WANG  Zuofei PEI
Affiliation:1.School of Communication and Information Engineering, Chongqing University of Posts and Communications, Chongqing 400065, China2.Key Laboratory of Mobile Communications Technology, Chongqing University of Posts and Communications, Chongqing 400065, China
Abstract:Class imbalance always exists in the process of network traffic classification. Considering the problem, a new feature selection algorithm using Weighted Symmetric Uncertainty (WSU) and Approximate Markov Blanket (AMB) is proposed. Firstly, a feature metric is defined using category distribution information, which is biased to minority classes. This makes it easier pick out features which have strong correlation with minority classes. Then, considering the correlation between features and categories and between features and features, the weighted symmetry uncertainty and approximate Markov blanket are used to delete the unrelated features and redundant features. Finally, the feature dimension is further reduced to determine the optimal feature subset, by using feature evaluation functions based on correlation measures and sequence search algorithms. The experimental results demonstrate that the algorithm can effectively improve the classification performance of minority classes without sacrificing the accuracy of the overall classification.
Keywords:
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号