共查询到20条相似文献,搜索用时 0 毫秒
1.
Visual saliency aims to locate the noticeable regions or objects in an image. In this paper, a coarse-to-fine measure is developed to model visual saliency. In the proposed approach, we firstly use the contrast and center bias to generate an initial prior map. Then, we weight the initial prior map with boundary contrast to obtain the coarse saliency map. Finally, a novel optimization framework that combines the coarse saliency map, the boundary contrast and the smoothness prior is introduced with the intention of refining the map. Experiments on three public datasets demonstrate the effectiveness of the proposed method. 相似文献
2.
Elaskily Mohamed A. Elnemr Heba A. Sedik Ahmed Dessouky Mohamed M. El Banby Ghada M. Elshakankiry Osama A. Khalaf Ashraf A. M. Aslan Heba K. Faragallah Osama S. Abd El-Samie Fathi E. 《Multimedia Tools and Applications》2020,79(27-28):19167-19192
Multimedia Tools and Applications - In this era of technology, digital images turn out to be ubiquitous in a contemporary society and they can be generated and manipulated by a wide variety of... 相似文献
3.
Stereoscopic images have become more and more prevalent following the rapid advances in 3D capturing and display techniques. However, there has been little research on visual content analysis for stereoscopic images. In this paper, we address the challenging problem of object detection and classification for stereoscopic images. An iterative method that can mutually boost salient object detection and object classification is proposed for stereoscopic images. This method includes two steps. In the first step, a 3D saliency detection method, which includes the contrastive and occlusion cues contained in each stereoscopic image pair along with the discriminative features provided by the SVM classifier, is proposed to localize object of interest in the stereoscopic images. In the second step, the bag of word features of foreground and background is pooled by using the localization information, and then is applied to train the SVM classifier. Each of the two steps benefits from the gradual improvement result in the other, no matter in the training or the testing process. To evaluate the performance of our approach, a 6-object class dataset of stereoscopic images real objects viewed under general lighting conditions, poses and viewpoints is set up. Our experimental results on the dataset, for object localization and object classification, demonstrate the effectiveness of the method. 相似文献
4.
Zhang Xufan Wang Yong Yan Jun Chen Zhenxing Wang Dianhong 《Multimedia Tools and Applications》2020,79(25-26):17331-17348
Multimedia Tools and Applications - Conventional saliency detection algorithms usually achieve good detection performance at the cost of high computational complexity, and most of them focus on... 相似文献
5.
Example-based object detection in images by components 总被引:27,自引:0,他引:27
Mohan A. Papageorgiou C. Poggio T. 《IEEE transactions on pattern analysis and machine intelligence》2001,23(4):349-361
We present a general example-based framework for detecting objects in static images by components. The technique is demonstrated by developing a system that locates people in cluttered scenes. The system is structured with four distinct example-based detectors that are trained to separately find the four components of the human body: the head, legs, left arm, and right arm. After ensuring that these components are present in the proper geometric configuration, a second example-based classifier combines the results of the component detectors to classify a pattern as either a “person” or a “nonperson.” We call this type of hierarchical architecture, in which learning occurs at multiple stages, an adaptive combination of classifiers (ACC). We present results that show that this system performs significantly better than a similar full-body person detector. This suggests that the improvement in performance is due to the component-based approach and the ACC data classification architecture. The algorithm is also more robust than the full-body person detection method in that it is capable of locating partially occluded views of people and people whose body parts have little contrast with the background 相似文献
6.
Image noise is a common problem frequently caused by insufficient lighting, low-quality cameras, image compression and other factors. While low image quality is expected to degrade results of visual recognition, most of the current methods and benchmarks for object recognition, such as Pascal Visual Object Classes Challenge and Microsoft Common Objects in Context Challenge, focus on relatively high-quality images. Meanwhile, object recognition in noisy images is a common problem in surveillance and other domains. In this work we address object detection in noisy images and propose a novel low-cost method for image denoising. When combined with the standard Deformable Parts Model and Regions with Convolutional Neural Network object detectors, our method shows improvements of object detection under varying levels of image noise. We present a comprehensive experimental evaluation and compare our method to other denoising techniques as well as to standard detectors re-trained on noisy images. Results are presented for the common Pascal Visual Object Classes benchmark for object detection and KAIST Multispectral Pedestrian Detection Benchmark with the real noise presence in night images. 相似文献
7.
8.
This paper describes a probabilistic framework for simultaneously performing object tracking and event detection in monocular videos. Mathematically, we cast the problem of jointly tracking and detecting semantic events as a principled model-based search problem in a multi-dimensional state space, where the tracking trajectory and event type are discovered via maximum a posteriori (MAP) optimization. The benefit of this approach comes from its combined utilization of particle probabilistic representation, multiple hypothesis retention, efficient particle propagation, and temporal optimization. We present qualitative and quantitative results from realistic video sequences to demonstrate the effectiveness of this approach. 相似文献
9.
Hamadi Abdelkader Lattar Hafsa Khoussa Mohamed El Bachir Safadi Bahjat 《Pattern Analysis & Applications》2020,23(1):27-44
Pattern Analysis and Applications - Multimedia documents indexing systems performances have been improved significantly in recent years, especially after the involvement of deep learning... 相似文献
10.
Niu Yuzhen Lin Lening Chen Yuzhong Ke Lingling 《Multimedia Tools and Applications》2017,76(24):26329-26353
Multimedia Tools and Applications - Visual saliency detection is useful in carrying out image compression, image segmentation, image retrieval, and other image processing applications. Majority of... 相似文献
11.
Neural Computing and Applications - In late 2019, a new Coronavirus disease (COVID-19) appeared in Wuhan, Hubei Province, China. The virus began to spread throughout many countries, affecting a... 相似文献
12.
Yang Ning Zhang Chen Zhang Yumo Yang Haowei Du Ling 《Multimedia Tools and Applications》2022,81(25):35831-35842
Multimedia Tools and Applications - Within-image co-salient object detection (wCoSOD) identifies the common and salient objects within an image, which can benefit for many applications, such as... 相似文献
13.
This article presents a framework developed for accomodating various object migrations in ‘statically-typed’ object databases. Requirements for supporting object migrations are stipulated, and a conceptual model for describing and facilitating different kinds of migrations is described. Associated issues of controlling such migrations are then addressed, along with an initial investigation on the interence of implied migration paths and the completeness of migration operators. Some guidelines are then given to help users conduct migrations more effectively. An implementation prototype on top of an object-oriented database system was built, which embodies full support of all migration types specified in the migration model. 相似文献
14.
15.
在高频声纳图像目标检测中,对图像分割后,需要对前景目标参数进行提取。水下声学图像相对于光学图像而言,分辨率相对较低,并且通常不会有特别复杂的图形图案进行处理。本文根据声学图像的特点,提出了一种连通成分标记算法,利用此算法可以对分割后的声纳目标进行标记,并快速提取出声图中的目标个数,以及各个目标的位置、面积等特征参数。此算法在VC++软件平台上对扫描声纳图像进行了实时处理,结果验证了该算法的可行性。 相似文献
16.
17.
Hussan Muzamil Parah Shabir A. Jan Aiman Qureshi G. J. 《Multimedia Tools and Applications》2022,81(13):18563-18594
Multimedia Tools and Applications - The present era is paving huge expansion to the transmission of digital data in fields like health, military intelligence, scientific research, and publication... 相似文献
18.
Farhang Sahba Hamid R. Tizhoosh Magdy M.M.A. Salama 《Expert systems with applications》2008,35(3):772-780
The principal contribution of this work is to design a general framework for an intelligent system to extract one object of interest from ultrasound images. This system is based on reinforcement learning. The input image is divided into several sub-images, and the proposed system finds the appropriate local values for each of them so that it can extract the object of interest. The agent uses some images and their ground-truth (manually segmented) version to learn from. A reward function is employed to measure the similarities between the output and the manually segmented images, and to provide feedback to the agent. The information obtained can be used as valuable knowledge stored in the Q-matrix. The agent can then use this knowledge for new input images. The experimental results for prostate segmentation in trans-rectal ultrasound images show high potential of this approach in the field of ultrasound image segmentation. 相似文献
19.
20.
A general shape context framework is proposed for object/image retrieval in occluded and cluttered environment with hundreds of models as the potential matches of an input. The approach is general since it does not require separation of input objects from complex background. It works by first extracting consistent and structurally unique local neighborhood information from inputs or models, and then voting on the optimal matches. Its performance degrades gracefully with respect to the amount of structural information that is being occluded or lost. The local neighborhood information applicable to the system can be shape, color, texture feature, etc. Currently, we employ shape information only. The mechanism of voting is based on a novel hyper cube based indexing structure, and driven by dynamic programming. The proposed concepts have been tested on database with thousands of images. Very encouraging results have been obtained. 相似文献