共查询到20条相似文献,搜索用时 0 毫秒
1.
3D movies/videos have become increasingly popular in the market; however, they are usually produced by professionals. This paper presents a new technique for the automatic conversion of 2D to 3D video based on RGB-D sensors, which can be easily conducted by ordinary users. To generate a 3D image, one approach is to combine the original 2D color image and its corresponding depth map together to perform depth image-based rendering (DIBR). An RGB-D sensor is one of the inexpensive ways to capture an image and its corresponding depth map. The quality of the depth map and the DIBR algorithm are crucial to this process. Our approach is twofold. First, the depth maps captured directly by RGB-D sensors are generally of poor quality because there are many regions missing depth information, especially near the edges of objects. This paper proposes a new RGB-D sensor based depth map inpainting method that divides the regions with missing depths into interior holes and border holes. Different schemes are used to inpaint the different types of holes. Second, an improved hole filling approach for DIBR is proposed to synthesize the 3D images by using the corresponding color images and the inpainted depth maps. Extensive experiments were conducted on different evaluation datasets. The results show the effectiveness of our method. 相似文献
2.
Multimedia Tools and Applications - With the advent of stereo camera saliency object detection for RGB-D image is attracting more and more interest. Most existing algorithms treat RGB-D image as... 相似文献
3.
This paper proposes a two-stage system for text detection in video images. In the first stage, text lines are detected based on the edge map of the image leading in a high recall rate with low computational time expenses. In the second stage, the result is refined using a sliding window and an SVM classifier trained on features obtained by a new Local Binary Pattern-based operator (eLBP) that describes the local edge distribution. The whole algorithm is used in a multiresolution fashion enabling detection of characters for a broad size range. Experimental results, based on a new evaluation methodology, show the promising overall performance of the system on a challenging corpus, and prove the superior discriminating ability of the proposed feature set against the best features reported in the literature. 相似文献
4.
提出了一种监控场景下的面部遮挡检测方法。基于AdaBoost算法进行人脸验证,通过面部划分,分块分析是否存在遮挡情况。首先判断是否有人进入,在有人进入的情况下进行面部遮挡检测,对眼部区域采用AdaBoost方法及墨镜特征提取方法判断是否遮挡,而对嘴部区域采用高斯肤色模型进行判断。实验结果表明,该方法能实时检测面部遮挡的情况,并达到了较好的效果,适用于银行ATM等监控场景,具有较高的应用价值。 相似文献
5.
Multimedia Tools and Applications - Within-image co-salient object detection (wCoSOD) identifies the common and salient objects within an image, which can benefit for many applications, such as... 相似文献
6.
为自动有效地获取交通监控场景中的多车道信息,提出一种利用骨架化边缘的多车道检测算法,以克服视频处理对固定场景和明确的先验车道位置信息的依赖。算法主要针对静态的交通背景图处理,采用背景提取、滤波和数字形态学预处理等,由Hough变换确定车道位置的骨架线;由行车方向约束车道线角度,利用车道线几何成像特性检测出准车道线,获取车道线和车道区域。实验表明,对不同的交通场景和不同光照条件,该方法能有效检测多车道,鲁棒性强,具有较高的工程应用价值。 相似文献
7.
Variation in illumination conditions caused by weather, time of day, etc., makes the task difficult when building video surveillance systems of real world scenes. Especially, cast shadows produce troublesome effects, typically for object tracking from a fixed viewpoint, since it yields appearance variations of objects depending on whether they are inside or outside the shadow. In this paper, we handle such appearance variations by removing shadows in the image sequence. This can be considered as a preprocessing stage which leads to robust video surveillance. To achieve this, we propose a framework based on the idea of intrinsic images. Unlike previous methods of deriving intrinsic images, we derive time-varying reflectance images and corresponding illumination images from a sequence of images instead of assuming a single reflectance image. Using obtained illumination images, we normalize the input image sequence in terms of incident lighting distribution to eliminate shadowing effects. We also propose an illumination normalization scheme which can potentially run in real time, utilizing the illumination eigenspace, which captures the illumination variation due to weather, time of day, etc., and a shadow interpolation method based on shadow hulls. This paper describes the theory of the framework with simulation results and shows its effectiveness with object tracking results on real scene data sets. 相似文献
8.
In this paper, we propose a context-sensitive technique for unsupervised change detection in multitemporal remote sensing images. The technique is based on fuzzy clustering approach and takes care of spatial correlation between neighboring pixels of the difference image produced by comparing two images acquired on the same geographical area at different times. Since the ranges of pixel values of the difference image belonging to the two clusters ( changed and unchanged) generally have overlap, fuzzy clustering techniques seem to be an appropriate and realistic choice to identify them (as we already know from pattern recognition literatures that fuzzy set can handle this type of situation very well). Two fuzzy clustering algorithms, namely fuzzy c-means (FCM) and Gustafson-Kessel clustering (GKC) algorithms have been used for this task in the proposed work. For clustering purpose various image features are extracted using the neighborhood information of pixels. Hybridization of FCM and GKC with two other optimization techniques, genetic algorithm (GA) and simulated annealing (SA), is made to further enhance the performance. To show the effectiveness of the proposed technique, experiments are conducted on two multispectral and multitemporal remote sensing images. A fuzzy cluster validity index (Xie-Beni) is used to quantitatively evaluate the performance. Results are compared with those of existing Markov random field (MRF) and neural network based algorithms and found to be superior. The proposed technique is less time consuming and unlike MRF does not require any a priori knowledge of distributions of changed and unchanged pixels. 相似文献
9.
The change-detection problem can be viewed as an unsupervised classification problem with two classes corresponding to changed and unchanged areas. Image differencing is a widely used approach to change detection. It is based on the idea of generating a difference image that represents the modulus of the spectral change vectors associated with each pixel in the study area. To separate out the changed and unchanged classes in the difference image automatically, any unsupervised technique can be used. Thresholding is one of the cheapest techniques among them. However, in thresholding approaches, selection of the best threshold value is not a trivial task. In this work, several non-fuzzy and fuzzy histogram thresholding techniques are investigated and compared for the change-detection problem. Experimental results, carried out on different multitemporal remote sensing images (acquired before and after an event), are used to assess the effectiveness of each of the thresholding techniques. Among all the thresholding techniques investigated here, Liu's fuzzy entropy followed by Kapur's entropy are found to be the most robust techniques. 相似文献
10.
Multimedia Tools and Applications - With the emergence of consumer RGB-D sensors, discriminative modeling has been shown to perform well in estimating human body pose. However, articulated hand... 相似文献
11.
This study proposes a superpixel-based active contour model (SACM) for unsupervised change detection from satellite images. The accuracy of change detection produced by the traditional active contour model suffers from the trade-off parameter. The SACM is designed to address this limitation through the incorporation of the spatial and statistical information of superpixels. The proposed method mainly consists of three steps. First, the difference image is created with change vector analysis method from two temporal satellite images. Second, statistical region merging method is applied on the difference image to produce a superpixel map. Finally, SACM is designed based on the superpixel map to detect changes from the difference image. The SACM incorporates spatial and statistical information and retains the accurate shapes and outlines of superpixels. Experiments were conducted on two data sets, namely Landsat-7 Enhanced Thematic Mapper Plus and SPOT 5, to validate the proposed method. Experimental results show that SACM reduces the effects of the trade-off parameter. The proposed method also increases the robustness of the traditional active contour model for input parameters and improves its effectiveness. In summary, SACM often outperforms some existing methods and provides an effective unsupervised change detection method. 相似文献
12.
针对传统火灾火焰探测技术存在不稳定、误判率高的缺点,提出了一种基于人工神经网络的火焰检测与识别算法。通过分析火焰图像的动态特性,利用火焰图像序列的离心率、放射性和整体移动等特征信息,结合学习向量量化(LVQ)神经网络进行训练仿真。实验结果表明,该算法能有效提高监控视频图像中可疑火焰的快速分类,稳定性强,具有较高的火焰识别准确率。 相似文献
13.
In this paper, we present a real-time image processing technique for the detection of steam in video images. The assumption made is that the presence of steam acts as a blurring process, which changes the local texture pattern of an image while reducing the amount of details. The problem of detecting steam is treated as a supervised pattern recognition problem. A statistical hidden Markov tree (HMT) model derived from the coefficients of the dual-tree complex wavelet transform (DT-CWT) in small 48×48 local regions of the image frames is used to characterize the steam texture pattern. The parameters of the HMT model are used as an input feature vector to a support vector machine (SVM) technique, specially tailored for this purpose. By detecting and determining the total area covered by steam in a video frame, a computerized image processing system can automatically decide if the frame can be used for further analysis. The proposed method was quantitatively evaluated by using a labelled image data set with video frames sampled from a real oil sand video stream. The classification results were 90% correct when compared to human labelled image frames. The technique is useful as a pre-processing step in automated image processing systems. 相似文献
14.
This paper reconstructs human model from multi-view RGB-D images of an Xbox One Kinect. We preprocess the depth images by implicit surface de-noising and then part-wisely register them into a point cloud. A template model is selected from the human model database to fit the registered point cloud of a human body by Laplacian deformation. Skin detection of RGB-D images helps to tightly constrain the skin parts of human body in template fitting step in order to get more precise and lifelike human model. We propose a robust skin detection method that is not affected by clothing pattern and background. Experiments demonstrate the effectiveness of our method. 相似文献
15.
With the advancement in digital video technology, video surveillance has been playing its vital role for ensuring safety and security. The surveillance systems are deployed in wide range of applications to invigilate stuffs and to analyse the activities in the environment. From the single or multi surveillance camera, a huge amount of data is generated, stored and processed for security purpose. Due to time constraints, it is a very tedious process for an analyst to go through the full content. This limitation has been overcome by the use of video summarization. The video summarization is intended to afford comprehensible analysis of video by removing duplications and extracting key frames from the video. To make an easily interpreted outline, the various available video summarization methods will try to shot the summary of the main occurrences, scenes, or objects in a frame. Depending on the applications, it is required to summarize the happenings in the scene and detect the objects (static/dynamic) which is recorded in the video. Hence this paper provides the various methods used for video summarization and a comparative study of different techniques. It also presents different object detection, object classification and object tracking algorithms available in the literature. 相似文献
17.
In this paper, we propose novel methods to evaluate the performance of object detection algorithms in video sequences. This procedure allows us to highlight characteristics (e.g., region splitting or merging) which are specific of the method being used. The proposed framework compares the output of the algorithm with the ground truth and measures the differences according to objective metrics. In this way it is possible to perform a fair comparison among different methods, evaluating their strengths and weaknesses and allowing the user to perform a reliable choice of the best method for a specific application. We apply this methodology to segmentation algorithms recently proposed and describe their performance. These methods were evaluated in order to assess how well they can detect moving regions in an outdoor scene in fixed-camera situations. 相似文献
18.
Surveillance cameras are vital source of information in crime investigations. A surveillance video must be recorded with correct field of view and be of good quality, otherwise, it may not be suitable for investigation or analysis purposes. Perpetrators may tamper the recorded video or the physical device itself, in order to conceal their dubious activities. Generally, surveillance systems are unmanned due to limitations of manual monitoring. Automatic detection of camera tamper events is crucial for timely operator intervention. We propose a new method for detecting video camera tampering events like occlusion, defocus and displacement. The features used are edge information, frame count, foreground objects’ coverage area and its static nature. Effectiveness of our method is tested through experimentation on public datasets. The results obtained are encouraging with high detection and low false alarm rates. The proposed method automatically detects routine problems with cameras like dirt on camera lens, fog and smoke. 相似文献
19.
In this paper, we propose a new computer-aided detection (CAD) – based method to detect pulmonary embolism (PE) in computed tomography angiography images (CTAI). Since lung vessel segmentation is the main objective to provide high sensitivity in PE detection, this method performs accurate lung vessel segmentation. To concatenate clogged vessels due to PEs, the starting region of PEs and some reference points (RPs) are determined. These RPs are detected according to the fixed anatomical structures. After lung vessel tree is segmented, the region, intensity, and size of PEs are used to distinguish them. We used the data sets that have heart disease or abnormal tissues because of lung disease except PE in this work. According to the results, 428 of 450 PEs, labeled by the radiologists from 33 patients, have been detected. The sensitivity of the developed system is 95.1% at 14.4 false positive per data set (FP/ds). With this performance, the proposed CAD system is found quite useful to use as a second reader by the radiologists. 相似文献
20.
Multimedia Tools and Applications - In this era of technology, digital images turn out to be ubiquitous in a contemporary society and they can be generated and manipulated by a wide variety of... 相似文献
|