Attention Res-Unet: 一种高效阴影检测算法 Attention Res-Unet: an efficient shadow detection algorithm期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Attention Res-Unet: 一种高效阴影检测算法

引用本文：	董月,冯华君,徐之海,陈跃庭,李奇.Attention Res-Unet: 一种高效阴影检测算法[J].浙江大学学报(自然科学版 ),2019,53(2):373-381.

作者姓名：	董月冯华君徐之海陈跃庭李奇

作者单位：	浙江大学现代光学仪器国家重点实验室，浙江杭州 310027

摘要：	图像中阴影像素的存在会导致图像内容的不确定性，对计算机视觉任务有害，因此常将阴影检测作为计算机视觉算法的预处理步骤. 提出全新的阴影检测网络结构，通过结合输入图像中包含的语义信息和像素之间的关联，提升网络性能. 使用预训练后的深层网络ResNeXt101作为特征提取前端，提取图像的语义信息，并结合U-net的设计思路，搭建网络结构，完成特征层的上采样过程. 在输出层之前使用非局部操作，为每一个像素提供全局信息，建立像素与像素之间的联系. 设计注意力生成模块和注意力融合模块，进一步提高检测准确率. 分别在SBU、UCF这2个阴影检测数据集上进行验证，实验结果表明，所提方法的目视效果及客观指标皆优于此前最优方法所得结果，在2个数据集上的平均检测错误率分别降低14.4%和14.9%.
关键词：	阴影检测特征提取语义信息像素关联非局部操作注意力机制卷积神经网络(CNN)
Attention Res-Unet: an efficient shadow detection algorithm

Yue DONG,Hua-jun FENG,Zhi-hai XU,Yue-ting CHEN,Qi LI.Attention Res-Unet: an efficient shadow detection algorithm[J].Journal of Zhejiang University(Engineering Science),2019,53(2):373-381.

Authors:	Yue DONG Hua-jun FENG Zhi-hai XU Yue-ting CHEN Qi LI

Abstract:	Shadow pixels in images can lead to the uncertainty of image content, which is harmful to computer vision tasks. Therefore, shadow detection is often used as a preprocessing step of computer vision algorithm. A shadow detection network was proposed by combining semantic information contained in input images and correlation between pixels. Pre-trained deep network ResNeXt101 was used as feature extraction front-end module to extract semantic information of the image. The baseline structure of the network was built to up-sample feature layers, encouraged by the design idea of U-Net. Non-local operations were added before the output layer to provide global information for each pixel and establish the relationship between pixels. At the same time, an attention generation module and an attention fusion module were developed to further improve shadow detection accuracy. Two common shadow detection datasets named SBU and UCF were utilized for verification. Experiment results showed that the proposed network outperformed previous methods in both visual effect and objective indicator. The proposed network showed 14.4% reduction on SBU and 14.9% reduction on UCF for the balance error rate, compared with the state-of-the-art framework.

Keywords:	shadow detection feature extraction semantic information pixel correlation non-local attention module convolutional neural network (CNN)
本文献已被 CNKI 等数据库收录！
	点击此处可从《浙江大学学报(自然科学版 )》浏览原始摘要信息
	点击此处可从《浙江大学学报(自然科学版 )》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏