首页 | 本学科首页   官方微博 | 高级检索  
     

GBDT与LR融合模型在加密流量识别中的应用
引用本文:王垚,李为,吴克河,崔文超.GBDT与LR融合模型在加密流量识别中的应用[J].计算机与现代化,2020,0(3):93-98.
作者姓名:王垚  李为  吴克河  崔文超
作者单位:华北电力大学(北京)控制与计算机工程学院,北京 102206;华北电力大学(北京)控制与计算机工程学院,北京 102206;华北电力大学(北京)控制与计算机工程学院,北京 102206;华北电力大学(北京)控制与计算机工程学院,北京 102206
摘    要:随着网络应用服务类型的多样化以及网络流量加密技术的不断发展,加密流量识别已经成为网络安全领域的一个重大挑战。传统的流量识别技术如深度包检测无法有效地识别加密流量,而基于机器学习理论的加密流量识别技术则表现出很好的效果。因此,本文提出一种融合梯度提升决策树算法(GBDT)与逻辑回归(LR)算法的加密流量分类模型,使用贝叶斯优化(BO)算法进行超参数调整,利用与时间相关的流特征对普通加密流量与VPN加密流量进行识别,实现了整体高于90%的流量识别准确度,与其他常用分类模型相比拥有更好的识别效果。

关 键 词:加密流量识别  梯度提升决策树  逻辑回归  流特征  贝叶斯优化  
收稿时间:2020-03-30

Application of Fusion Model of GBDT and LR in Encrypted Traffic Identification
WANG Yao,LI Wei,WU Ke-he,CUI Wen-chao.Application of Fusion Model of GBDT and LR in Encrypted Traffic Identification[J].Computer and Modernization,2020,0(3):93-98.
Authors:WANG Yao  LI Wei  WU Ke-he  CUI Wen-chao
Abstract:With the diversification of network application service types and the continuous development of traffic encryption technology, encrypted traffic identification has become a major challenge in the field of network security. Traditional traffic identification techniques, such as deep packet inspection, cannot effectively identify encrypted traffic, while the identification technology based on machine learning theory has shown good results. For this, an optimized encrypted traffic classification model based on the fusion of GBDT and LR is proposed, in which Bayesian optimization (BO) algorithm is used for hyperparameter tuning. By using the time-related flow features to identify common encrypted traffic and VPN encrypted traffic, it obtains an overall accuracy more than 90%, which gets better recognition effect than other common classification models.
Keywords:encrypted traffic identification  GBDT (Gradient Boosting Decision Tree)  LR (Logistic Regression)  flow features  Bayesian optimization  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机与现代化》浏览原始摘要信息
点击此处可从《计算机与现代化》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号