首页 | 本学科首页   官方微博 | 高级检索  
     

Attention Res-Unet: 一种高效阴影检测算法
引用本文:董月,冯华君,徐之海,陈跃庭,李奇.Attention Res-Unet: 一种高效阴影检测算法[J].浙江大学学报(自然科学版 ),2019,53(2):373-381.
作者姓名:董月  冯华君  徐之海  陈跃庭  李奇
作者单位:浙江大学 现代光学仪器国家重点实验室,浙江 杭州 310027
摘    要:图像中阴影像素的存在会导致图像内容的不确定性,对计算机视觉任务有害,因此常将阴影检测作为计算机视觉算法的预处理步骤. 提出全新的阴影检测网络结构,通过结合输入图像中包含的语义信息和像素之间的关联,提升网络性能. 使用预训练后的深层网络ResNeXt101作为特征提取前端,提取图像的语义信息,并结合U-net的设计思路,搭建网络结构,完成特征层的上采样过程. 在输出层之前使用非局部操作,为每一个像素提供全局信息,建立像素与像素之间的联系. 设计注意力生成模块和注意力融合模块,进一步提高检测准确率. 分别在SBU、UCF这2个阴影检测数据集上进行验证,实验结果表明,所提方法的目视效果及客观指标皆优于此前最优方法所得结果,在2个数据集上的平均检测错误率分别降低14.4%和14.9%.

关 键 词:阴影检测  特征提取  语义信息  像素关联  非局部操作  注意力机制  卷积神经网络(CNN)  

Attention Res-Unet: an efficient shadow detection algorithm
Yue DONG,Hua-jun FENG,Zhi-hai XU,Yue-ting CHEN,Qi LI.Attention Res-Unet: an efficient shadow detection algorithm[J].Journal of Zhejiang University(Engineering Science),2019,53(2):373-381.
Authors:Yue DONG  Hua-jun FENG  Zhi-hai XU  Yue-ting CHEN  Qi LI
Abstract:Shadow pixels in images can lead to the uncertainty of image content, which is harmful to computer vision tasks. Therefore, shadow detection is often used as a preprocessing step of computer vision algorithm. A shadow detection network was proposed by combining semantic information contained in input images and correlation between pixels. Pre-trained deep network ResNeXt101 was used as feature extraction front-end module to extract semantic information of the image. The baseline structure of the network was built to up-sample feature layers, encouraged by the design idea of U-Net. Non-local operations were added before the output layer to provide global information for each pixel and establish the relationship between pixels. At the same time, an attention generation module and an attention fusion module were developed to further improve shadow detection accuracy. Two common shadow detection datasets named SBU and UCF were utilized for verification. Experiment results showed that the proposed network outperformed previous methods in both visual effect and objective indicator. The proposed network showed 14.4% reduction on SBU and 14.9% reduction on UCF for the balance error rate, compared with the state-of-the-art framework.
Keywords:shadow detection  feature extraction  semantic information  pixel correlation  non-local  attention module  convolutional neural network (CNN)  
本文献已被 CNKI 等数据库收录!
点击此处可从《浙江大学学报(自然科学版 )》浏览原始摘要信息
点击此处可从《浙江大学学报(自然科学版 )》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号