首页 | 本学科首页   官方微博 | 高级检索  
     

一种主动学习式P2P流识别方法
引用本文:戴磊,王源,刘科科.一种主动学习式P2P流识别方法[J].计算机应用研究,2012,29(2):717-721.
作者姓名:戴磊  王源  刘科科
作者单位:中国电子科学研究院,北京,100041
摘    要:P2P流的识别对于网络的维护与运营都具有重要意义,基于机器学习的流识别技术是目前研究的热点和难点内容,但目前仍然存在着建立分类模型需要大量适用的训练数据、训练数据的标记需要依赖领域专家以及因此而导致的工作量及难度过大和实用性不强等问题,而当前的研究工作很少涉及到这些问题的解决办法。针对这一问题,采用主动学习技术提取少量高质量的训练样本进行建模,并结合SVM分类算法提出了一种基于锦标赛选择的样本筛选方法。实验结果表明,其相对于已有的流识别方法,能够在仅依赖少量高质量训练样本的前提下,保证较高召回率及较低误报率,更适用于现实网络环境。

关 键 词:支持向量机  主动学习  机器学习

Peer-to-peer traffic identification using active learning
DAI Lei,WANG Yuan,LIU Ke-ke.Peer-to-peer traffic identification using active learning[J].Application Research of Computers,2012,29(2):717-721.
Authors:DAI Lei  WANG Yuan  LIU Ke-ke
Affiliation:(China Academy of Electronics & Information Technology,Beijing 100041,China)
Abstract:The identification of peer-to-peer(P2P) traffic is essential to network maintenance and management.Traffic identification using machine learning(ML) techniques has been an active and difficult research topic for recent years.However,there are still some unresolved and scarcely addressed problems such as the difficulties in obtaining adequate qualified training data for the supervised classifiers to model traffic patterns,the data acquisition task is always time-consuming and greatly relies on the domain experts.To solve the problem,this paper introduced active learning method to select the most qualified data for training and proposed a tournament selection based instance selection method for SVM.The experimental results demonstrate that the proposed method is able to guarantee high recall rate and low false positives by using a small quantity of high qualified data.Therefore,it is more suitable for the real network applications than the traditional ones.
Keywords:SVM(support vector machine)  active learning  machine learning
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号