首页 | 本学科首页   官方微博 | 高级检索  
     

Simplex噪声区域中目标特征的对抗攻击算法
引用本文:陈伟,刘亚洲,魏松杰.Simplex噪声区域中目标特征的对抗攻击算法[J].计算机辅助设计与图形学学报,2021,33(2):250-259.
作者姓名:陈伟  刘亚洲  魏松杰
作者单位:南京理工大学计算机科学与工程学院 南京 210094;南京理工大学计算机科学与工程学院 南京 210094;南京理工大学计算机科学与工程学院 南京 210094
基金项目:国家自然科学基金;上海航天科技创新基金
摘    要:针对当前黑盒环境中,主流的图像对抗攻击算法在有限的目标模型访问查询次数条件下攻击准确率低的问题,提出一种基于目标特征和限定区域采样的目标攻击算法.首先根据原始图像和目标图像生成初始对抗样本;然后在Simplex-mean噪声区域中进行扰动采样,并根据对抗样本和原始图像差异度以及目标特征区域位置决定扰动大小;最后将扰动作用于初始对抗样本中,使新的对抗样本在保持对抗性的同时缩小与原始图像的差异度.以常见的图像分类模型InceptionV3和VGG16等为基础,在相同的目标模型访问查询,以及与对抗样本和原始图像的l2距离小于55.89的条件下,采用BBA等算法对同一图像集和目标集进行攻击.实验结果表明,在同样的目标模型访问查询和l2=55.89的限制条件下,不超过5 000次目标查询时,在InceptionV3模型上该算法的攻击准确率比同类攻击算法提升至少50%.

关 键 词:目标攻击  图像分类  黑盒环境  区域采样  对抗样本

Adversarial Attack Algorithm on Target Features in Simplex Noise Area
Chen Wei,Liu Yazhou,Wei Songjie.Adversarial Attack Algorithm on Target Features in Simplex Noise Area[J].Journal of Computer-Aided Design & Computer Graphics,2021,33(2):250-259.
Authors:Chen Wei  Liu Yazhou  Wei Songjie
Affiliation:(School of Computer Science and Engineering,Nanjing University of Science and Technology,Nanjing 210094)
Abstract:A target adversarial attack algorithm based on target features and limited area sampling is proposed to improve the low attack accuracy of the current adversarial attack algorithms when only limited target model access queries are allowed in the black-box scenario.Firstly,an initial adversarial example is generated by the original image and the target image.Then the disturbance is sampled in the Simplex-mean noise region and determined by the location of target feature region and the difference between the adversarial example and the original image.The disturbance is used in the initial adversarial example to keep the newly generated one adversarial and to reduce the difference between it and the original image.Based on the common image classification model InceptionV3 and VGG16,under the same target model access query and the l2 distance between the adversarial example and the original image is less than 55.89.The experimental results using algorithms such as BBA to attack the same image set and target set show that the accuracy of the proposed algorithm is at least 50%higher than that of similar attack algorithms under the same target model access query and l2=55.89,with no more than 5000 target queries in InceptionV3 model.
Keywords:target attack  image classification  black-box scenario  regional sampling  adversarial example
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号