基于稠密扩张卷积的图像语义分割模型 Image Semantic Segmentation Model Based on Dense Dilation Convolution期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于稠密扩张卷积的图像语义分割模型

引用本文：	张富财,许建龙,包晓安. 基于稠密扩张卷积的图像语义分割模型[J]. 计算机系统应用, 2022, 31(3): 19-29. DOI: 10.15888/j.cnki.csa.008376

作者姓名：	张富财许建龙包晓安

作者单位：	浙江理工大学信息学院, 杭州 310018

基金项目：	浙江省重点研发计划(2020C03094)

摘要：	为解决图像语义分割任务中面对的分割场景的复杂性、分割对象的多样性及分割对象空间位置的差异性问题, 提高语义分割模型的精度, 提出基于稠密扩张卷积的双分支多层级语义分割网络(double branch and multi-stages network, DBMSNet). 首先采用主干网络提取输入图像的4个不同分辨率的特征图(De1、De2、De3、De4), 其次采用特征精炼(feature refine, FR)模块对De1和De3这两个特征图进行特征精炼处理, 特征精炼处理之后的输出分支经过混合扩张卷积模块(mixed dilation module, MDM)编码空间位置特征, De4分支采用金字塔池化模块(pyramid pooling module, PPM)编码高级语义特征, 最后将两个分支进行融合, 输出分割结果. 在数据集CelebAMask-HQ和Cityscapes中进行实验, 分别得到mIoU精度为74.64%、78.29%. 结果表明, 本文方法的分割精度高于对比方法, 且具有更少的参数量.
关键词：	深度学习图像语义分割扩张卷积稠密连接多层级特征
收稿时间：	2021-05-23
修稿时间：	2021-06-21
Image Semantic Segmentation Model Based on Dense Dilation Convolution

ZHANG Fu-Cai,XU Jian-Long,BAO Xiao-An. Image Semantic Segmentation Model Based on Dense Dilation Convolution[J]. Computer Systems& Applications, 2022, 31(3): 19-29. DOI: 10.15888/j.cnki.csa.008376

Authors:	ZHANG Fu-Cai XU Jian-Long BAO Xiao-An

Affiliation:	School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China

Abstract:	Semantic segmentation is a very challenging task because of the complexity of parsing the scene, the diversity of segmented objects, and the differences in spatial positions of objects. To tackle this dilemma, this paper proposes a novel architecture named double branch and multi-stage network (DBMSNet) based on dense dilated convolution. Firstly, four feature maps (De1, De2, De3, and De4) with different resolutions are extracted by the backbone network, and then the feature refinement maps of De1 and De3 are output through the feature refinement (FR) module. Secondly, the output branch is processed by the mixed dilation module (MDM) to extract rich spatial location features, while the De4 branch is processed by the pyramid pooling module (PPM) to extract multi-scale semantic information. Finally, the two branches are merged and the segmentation result is output. Comprehensive experiments are conducted on two public datasets of CelebAMask-HQ and Cityscapes, on which our model achieves mean intersection-over-union (mIoU) scores of 74.64% and 78.29%, respectively. The results show that the segmentation accuracy of this study is higher than that of the counterpart method, and this method has fewer parameters.

Keywords:	deep learning image semantic segmentation dilation convolution dense connection multi-stages feature
本文献已被万方数据等数据库收录！
	点击此处可从《计算机系统应用》浏览原始摘要信息
	点击此处可从《计算机系统应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏