首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于改进差分进化算法的源码漏洞检测模型的冷启动方法
引用本文:袁子龙,吴秋新,刘韧,秦宇.一种基于改进差分进化算法的源码漏洞检测模型的冷启动方法[J].计算机应用研究,2023,40(7).
作者姓名:袁子龙  吴秋新  刘韧  秦宇
作者单位:北京信息科技大学 理学院,北京信息科技大学 理学院,北京卓识网安技术股份有限公司,中国科学院软件研究所 可信计算与信息保障实验室
基金项目:国家自然科学基金资助项目(61872343)
摘    要:源码漏洞检测作为重要的研究课题,其传统方法在面对规模庞大、脆弱性多样化的代码时,存在人工参与度高、未知漏洞检测能力弱等诸多问题。针对以上问题,基于开放源代码的语法语义信息,提出了改进差分进化算法的源码漏洞检测模型冷启动优化方法。运用源码切片技术、启发式优化算法及神经网络模型,解决了漏洞检测模型在启动之初超参数无法准确选定的问题。对于实验中样本信息冗余和正负样本鲜明特征混和的情况,提出了正负样本鲜明特征交叉剔除的思想以减小模型的漏报率及误报率。实验表明,该方法可以有效加速模型的收敛,使得模型在10个epoch内达到稳定,在提升源码漏洞检测模型准确率的同时其收敛速度比其他模型提升了2~3倍。在后续改进实验中,源码漏洞检测模型在所有类型漏洞的准确率上均提高了1~3个百分点,充分证明了改进措施的有效性。该方法的优化策略和改进措施同样适用于其他神经网络分类模型,可以为漏洞检测领域探索新方法和新模型提供思路。

关 键 词:语法语义    改进差分进化    漏洞检测    BiGRU
收稿时间:2022/11/17 0:00:00
修稿时间:2023/6/13 0:00:00

Cold start method for source code vulnerability detection model based on improved differential evolution algorithm
Yuan Zilong,Wu Qiuxin,Liu Ren and Qin Yu.Cold start method for source code vulnerability detection model based on improved differential evolution algorithm[J].Application Research of Computers,2023,40(7).
Authors:Yuan Zilong  Wu Qiuxin  Liu Ren and Qin Yu
Affiliation:School of Applied Science,Beijing Information Science and Technology University,,,
Abstract:As an important research topic, source code vulnerability detection has many problems in its traditional methods, such as high manual participation, weak detection ability of unknown vulnerabilities. Aiming at the above problems, based on the syntactic and semantic information of open source code and improved differential evolution algorithm, this paper proposed a cold start optimization method for source code vulnerability detection model. This paper used source code slicing technology, heuristic optimization algorithms and neural network models, which solved the problem that the hyperparameters couldn''t be correctly selected at the beginning of the vulnerability detection model. For the case of sample information redundancy and mixture of positive and negative sample distinctive features in the experiment, it proposed an idea of cross-exclusion of positive and negative sample distinctive features to reduce the false negative rate and false positive rate of the model. Experiments show that this method can effectively accelerate the convergence of the model, and making the model stable within 10 epochs. While improving the accuracy of the source code vulnerability detection model, its convergence speed is 2~3 times higher than other models. In the subsequent improvement experiments, the source code vulnerability detection model has improved the accuracy of each type of vulnerability by 1~3 percentage points, which fully proves the effectiveness of the improvement measures. The optimization strategies and improvement measures of this method are also applicable to other neural network classification models, and it can provide ideas for exploring new methods and models in the field of vulnerability detection.
Keywords:syntax semantics  improved differential evolution  vulnerability detection  BiGRU
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号