基于改进ExfuseNet模型的街景语义分割 Semantic Segmentation of Streetscape Based on Improved ExfuseNet期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于改进ExfuseNet模型的街景语义分割

引用本文：	陈劲宏,陈玮,尹钟.基于改进ExfuseNet模型的街景语义分割[J].电子科技,2022,35(6):28-34.

作者姓名：	陈劲宏陈玮尹钟

作者单位：	上海理工大学光电信息与计算机工程学院,上海 200093

摘要：	使用ExfuseNet模型进行街景语义分割时,由于街景图像背景复杂度较高,造成感兴趣类之间的面积占比与分布不均衡,特别是图像中面积占比低且密度低的感兴趣目标,越到网络深层越容易被错误分类,最终导致模型分割性能下降。为解决该问题,文中对ExfuseNet模型进行了改进。为了获取不同尺度的语义信息,在不增加模型参数量的条件下,多监督模块采用不同空洞率的带孔卷积。在下采样特征融合后,立刻采用随机丢弃层来减少模型参数量,提高泛化力。在主输出前采用CBAM注意力机制模块以便更高效地对感兴趣目标类的深度语义信息进行采样,并在多监督模块之后采用类平衡函数来改善数据集Camvid的类不平衡问题。实验结果表明,改进的ExfuseNet模型语义分割效果有明显提升,其均交并比提升到了68.32%,Pole类分类准确率提升到38.14%。
关键词：	街景图像多监督空洞率带孔卷积随机丢弃层泛化力注意力机制类平衡均交并比
收稿时间：	2021-02-04
Semantic Segmentation of Streetscape Based on Improved ExfuseNet

CHEN Jinhong,CHEN Wei,YIN Zhong.Semantic Segmentation of Streetscape Based on Improved ExfuseNet[J].Electronic Science and Technology,2022,35(6):28-34.

Authors:	CHEN Jinhong CHEN Wei YIN Zhong

Affiliation:	School of Optical-Electrical and Computer Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China

Abstract:	When using the ExfuseNet model for streetscape semantic segmentation, due to the high background complexity of the street view image, the area ratio and distribution between the classes of interest are unbalanced. Interesting targets with low area and low density in the image are more likely to be misclassified as they go deeper into the network, which ultimately leads to the degradation of model segmentation performance. To solve this problem, an improved Exfusenet model is proposed. In order to obtain the semantic information of different scales without increasing the amount of model parameters, the multi-monitor module adopts atrous convolution with different rates. After the down-sampling features are fused, the random discarding layer is used immediately to reduce the amount of model parameters and improve the generalization ability. Before the main output, the CBAM attention mechanism module is used to sample the depth semantic information of the target class of interest more efficiently, and the class balance function is used after the multi-supervision module to improve the class imbalance problem of the data set Camvid. The experimental results show that the semantic segmentation effect of the improved ExfuseNet model has been significantly improved, MIOU has increased to 68.32%, and the classification accuracy rate of the Pole class has increased to 38.14%.

Keywords:	street view image multiple supervision dilated rate dilated convolution random drop layer generalization attention mechanism class balance mean intersection over union
本文献已被万方数据等数据库收录！
	点击此处可从《电子科技》浏览原始摘要信息
	点击此处可从《电子科技》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏