首页 | 本学科首页   官方微博 | 高级检索  
     

面向FPGA部署的CNN-SVM算法研究与实现
引用本文:周彦臻,吴瑞东,于 潇,付 平,刘 冰,李君宝.面向FPGA部署的CNN-SVM算法研究与实现[J].电子测量与仪器学报,2021,35(4):90-98.
作者姓名:周彦臻  吴瑞东  于 潇  付 平  刘 冰  李君宝
作者单位:哈尔滨工业大学 电子与信息工程学院 哈尔滨 150000;沈阳飞机设计研究所 沈阳 110035
基金项目:国家自然科学基金(61671170)项目资助
摘    要:卷积神经网络-支持向量机(CNN-SVM)混合算法结合了CNN特征提取能力和SVM分类性能,在计算复杂度和解决小样本问题上具有一定优势,目前已在故障诊断、医学图像处理等领域得到了一定应用,同时,由于其计算复杂度较低,也引起了边缘计算领域的关注。针对边缘计算场景中对算法性能和功耗的要求,提出了一种面向FPGA平台的CNN-SVM算法优化与实现方法。首先,结合FPGA的架构特点,对CNN-SVM算法结构进行了硬件适应性优化,包括模型压缩和分类器核函数的选取。其次,采用了软硬件协同和高层次综合(HLS)设计方法,完成了CNN-SVM算法加速器的设计与实现。实验结果表明,在ZCU102上,加速器的FPS(frames per second)达到了18.33 K,计算速度为1.474 GMAC/s,相对于CPU平台四核Cortex-A57和Ryzen7 3700x分别实现了23.57和4.92倍加速,相对于Jetson Nano GPU和GTX750平台能耗比分别达到了33.24和50.27。

关 键 词:CNN-SVM算法  FPGA实现  硬件加速器设计  软硬件协同设计

Research and implementation of CNN-SVM algorithm based on FPGA
Zhou Yanzhen,Wu Ruidong,Yu Xiao,Fu Ping,Liu Bing,Li Junbao.Research and implementation of CNN-SVM algorithm based on FPGA[J].Journal of Electronic Measurement and Instrument,2021,35(4):90-98.
Authors:Zhou Yanzhen  Wu Ruidong  Yu Xiao  Fu Ping  Liu Bing  Li Junbao
Affiliation:1. College of Electronic and Information Engineering, Harbin Institute of Technology;2. Shenyang Aircraft D&R Institute
Abstract:CNN-SVM hybrid algorithm combines the feature extraction ability of CNN and the classification performance of SVM, it has certain advantages in computational complexity and can solve small sample problem. It has been applied in fault diagnosis, medical image processing and other fields, at the same time, it gets attention in the field of edge computing due to its low computational complexity. Aiming at the requirements of algorithm performance and power consumption in edge computing scenarios, an optimization and implementation method of CNN-SVM algorithm for FPGA platform is proposed. First, combined with the architecture characteristics of FPGA, the hardware adaptability optimization of CNN-SVM algorithm structure is carried out, including the model compression and the selection of kernel function of classifier. Secondly, the design and implementation of CNN-SVM algorithmic accelerator is completed by using software and hardware cooperation and high level synthesis ( HLS) design method. The experimental results show that on ZCU102, the frames per second(FPS) of accelerator reaches 18. 33 K, the computing speed is 1. 474 GMAC/ s. Compared with the CPU platform, quad core Cortex-A57 and Ryzen7 3700x achieve 23. 57 and 4. 92 times acceleration respectively, compared with Jetson Nano GPU and GTX750 platform, the energy consumption ratio is 33. 24 and 50. 27 respectively.
Keywords:CNN-SVM algorithm  FPGA implementation  hardware accelerator design  hardware and software co-design
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《电子测量与仪器学报》浏览原始摘要信息
点击此处可从《电子测量与仪器学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号