基于背景学习的显著物体检测 Salient object detection based on background learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于背景学习的显著物体检测

引用本文：	项导,侯赛辉,王子磊.基于背景学习的显著物体检测[J].中国图象图形学报,2016,21(12):1634-1643.

作者姓名：	项导侯赛辉王子磊

作者单位：	中国科学技术大学自动化系, 合肥 230027,中国科学技术大学自动化系, 合肥 230027,中国科学技术大学自动化系, 合肥 230027

基金项目：	国家自然科学基金项目（61203256，61233003）；安徽省自然科学基金项目（1408085MF112）

摘要：	目的显著物体检测的目标是提取给定图像中最能吸引人注意的物体或区域，在物体识别、图像显示、物体分割、目标检测等诸多计算机视觉领域中都有广泛应用。已有的基于局部或者全局对比度的显著物体检测方法在处理内容复杂的图像时，容易造成检测失败，其主要原因可以总结为对比度参考区域设置的不合理。为提高显著物体检测的完整性，提出背景驱动的显著物体检测算法，在显著值估计和优化中充分利用背景先验。方法首先采用卷积神经网络学习图像的背景分布，然后从得到的背景图中分割出背景区域作为对比度计算参考区域来估计区域显著值。最后，为提高区域显著值的一致性，采用基于增强图模型的优化实现区域显著值的扩散，即在传统k-正则图局部连接的基础上，添加与虚拟节点之间的先验连接和背景区域节点之间的非局部连接，实现背景先验信息的嵌入。结果在公开的ASD、SED、SOD和THUS-10000数据库上进行实验验证，并与9种流行的算法进行对比。本文算法在4个数据库上的平均准确率、查全率、F-measure和MAE指标分别为0.873 6、0.795 2、0.844 1和0.112 2，均优于当前流行的算法。结论以背景区域作为对比度计算参考区域可以明显提高前景区域的显著值。卷积神经网络可以有效学习图像的背景分布并分割出背景区域。基于增强图模型的优化可以进一步实现显著值在前景和背景区域的扩散，提高区域显著值的一致性，并抑制背景区域的显著性响应。实验结果表明，本文算法能够准确、完整地检测图像的显著区域，适用于复杂图像的显著物体检测或物体分割应用。
关键词：	显著物体检测背景学习背景先验卷积神经网络增强图模型优化
收稿时间：	2016/4/21 0:00:00
修稿时间：	2016/8/29 0:00:00
Salient object detection based on background learning

Xiang Dao,Hou Saihui and Wang Zilei.Salient object detection based on background learning[J].Journal of Image and Graphics,2016,21(12):1634-1643.

Authors:	Xiang Dao Hou Saihui and Wang Zilei

Affiliation:	Department of Automation, University of Science and Technology of China, Hefei 230027, China,Department of Automation, University of Science and Technology of China, Hefei 230027, China and Department of Automation, University of Science and Technology of China, Hefei 230027, China

Abstract:	Objective Salient object detection aims to identify spatial locations and scales of the most attention-grabbing objects in a given image, which is shown to be helpful in various computer vision tasks, such as object recognition, adaptive image display, and object detection. Different from eye-fixation saliency prediction, salient object detection emphasizes the saliency and wholeness of detected objects. Thus, dealing with cluttered background and diversity of object parts within an image has always been one of the major challenges in salient object detection. Bottom-up visual saliency is commonly characterized by the contrast of primitive image features at the pixel or super-pixel levels because contrast is the most predominant factor in human cognition. In the literature, the local or global contrast is usually adopted to straightforwardly derive the saliency map, where the contrast in a certain region is calculated by comparing its feature with that of the reference regions. However, such methods using local or global contrast reference regions may fail to detect whole salient objects, especially when dealing with complicated images. We attribute their failure to unreasonable setting of contrast reference regions. To enhance the wholeness of the detected salient objects, an explicit background-driven method is proposed, in which background prior is comprehensively utilized in saliency estimation and optimization. Method To obtain the background regions of an image for contrast estimation, deep convolutional neural networks were initially used to learn a background map representing the likelihood of each region belonging to the background. From the obtained background map, the background regions could be segmented with a simple thresholding strategy. The learned background regions were then used as references for region contrast computation. To enhance the consistency between the foreground and background regions, enhanced graph-based optimization was adopted to propagate saliencies along the graph. Besides the conventional local connections in a k-regular graph, prior connections with virtual nodes and non-local connections between nodes belonging to background regions were also added to the graph to embed the learned background prior. Result To verify the effectiveness of the proposed salient object detection method, comprehensive experiments were conducted on four public saliency detection datasets, namely, ASD, SED, SOD, and THUS-10000. The results were compared with those of nine state-of-the-art methods. Four indicators (i.e., precision, recall, F-measure, and MAE) were adopted for comparison. The average scores in precision, recall, F-measure, and MAE of our method were 0.873 6, 0.795 2, 0.844 1, and 0.112 2, respectively, which showed that our method outperformed other popular methods. The best results on all of the datasets were achieved using the proposed method, thereby demonstrating its effectiveness and superiority. Conclusion Restricting the contrast reference regions to the background could significantly improve contrast-based saliency estimation. The background regions in an image could be effectively learned by convolutional neural networks. An enhanced graph-based optimization could fuse the saliency confidences of different parts from the same object to discover the whole salient object; thus, a more consistent and background-suppressed saliency map could be generated. Experimental results showed that the proposed method can be successfully used in salient object detection and object segmentation in natural images.

Keywords:	salient object detection background learning background prior convolutional neural networks enhanced graph based optimization

	点击此处可从《中国图象图形学报》浏览原始摘要信息
	点击此处可从《中国图象图形学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏