首页 | 本学科首页   官方微博 | 高级检索  
     

基于ENet的轻量级语义分割算法研究
引用本文:徐世杰,杜煜,鹿鑫,吴思凡. 基于ENet的轻量级语义分割算法研究[J]. 计算机工程与科学, 2021, 43(8): 1454-1460. DOI: 10.3969/j.issn.1007-130X.2021.08.014
作者姓名:徐世杰  杜煜  鹿鑫  吴思凡
作者单位:(北京联合大学智慧城市学院,北京 100101)
基金项目:国家自然科学基金(91420202)
摘    要:语义分割算法能够对图像进行像素级的分类,广泛应用于无人驾驶、医学图像处理和工业自动化等领域,具有重要研究价值.对语义分割算法的研究集中在提升分割精度、降低参数量和增加推理速度3个方面.经典的轻量语义分割算法ENet使用多层卷积的编解码器和大量的空洞卷积来避免过多的下采样和利用空间信息,虽能保证一定的空间信息完整性与较大...

关 键 词:语义分割  轻量级  实时性  注意力机制  感受野  空洞卷积
收稿时间:2020-05-15
修稿时间:2020-08-24

A lightweight semantic segmentation algorithm based on ENet
XU Shi-jie,DU Yu,LU Xin,WU Si-fan. A lightweight semantic segmentation algorithm based on ENet[J]. Computer Engineering & Science, 2021, 43(8): 1454-1460. DOI: 10.3969/j.issn.1007-130X.2021.08.014
Authors:XU Shi-jie  DU Yu  LU Xin  WU Si-fan
Affiliation:(Smart City College,Beijing Union University,Beijing 100101,China)
Abstract:Semantic segmentation algorithms can classify images at the pixel level, and are widely used in fields such as unmanned driving, medical image processing, and industrial automation, and have important research value. The research of semantic segmentation algorithms focuses on three aspects: improving the accuracy of segmentation, reducing the amount of parameters and increasing the speed of inference. The lightweight semantic segmentation algorithm ENet uses a multi-layer convolutional codec and a large number of dilated convolutions to avoid excessive downsampling and use of spatial information. Although it retains some spatial information integrity and large receptive field, the codec is bloated, the transmission of spatial information is poor, and the sensory field overflows and causes grid effect. Aiming at the above problems, this paper tailors the ENet algorithm structure, uses the attention mechanism and the pyramid dilated convolution to design spatial information transmission module, optimizes the algorithm structure, improves the algorithm receptive field, and completely transmits the spatial information transmission. The experimental results on public datasets Cityscapes and BDD100K show that the new module can improve the performance of the original algorithm with a smaller amount of parameters and calculations, which proves the redundancy of the original algorithm and the effectiveness of the designed module.
Keywords:semantic segmentation  lightweight  real-time  attention mechanism  receptive field  dilated convolution  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号