期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

An automatic 2D to 3D video conversion approach based on RGB-D images

Pan Baiyu Zhang Liming Yin Hanxiong Lan Jun Cao Feilong 《Multimedia Tools and Applications》2021,80(13):19179-19201

3D movies/videos have become increasingly popular in the market; however, they are usually produced by professionals. This paper presents a new technique for the automatic conversion of 2D to 3D video based on RGB-D sensors, which can be easily conducted by ordinary users. To generate a 3D image, one approach is to combine the original 2D color image and its corresponding depth map together to perform depth image-based rendering (DIBR). An RGB-D sensor is one of the inexpensive ways to capture an image and its corresponding depth map. The quality of the depth map and the DIBR algorithm are crucial to this process. Our approach is twofold. First, the depth maps captured directly by RGB-D sensors are generally of poor quality because there are many regions missing depth information, especially near the edges of objects. This paper proposes a new RGB-D sensor based depth map inpainting method that divides the regions with missing depths into interior holes and border holes. Different schemes are used to inpaint the different types of holes. Second, an improved hole filling approach for DIBR is proposed to synthesize the 3D images by using the corresponding color images and the inpainted depth maps. Extensive experiments were conducted on different evaluation datasets. The results show the effectiveness of our method.

相似文献

2.

RGB-D image saliency detection from 3D perspective

Liu Zhengyi Song Tengfei Xie Feng 《Multimedia Tools and Applications》2019,78(6):6787-6804

Multimedia Tools and Applications - With the advent of stereo camera saliency object detection for RGB-D image is attracting more and more interest. Most existing algorithms treat RGB-D image as... 相似文献

3.

A two-stage scheme for text detection in video images

Marios Anthimopoulos Basilis Gatos Ioannis Pratikakis 《Image and vision computing》2010

This paper proposes a two-stage system for text detection in video images. In the first stage, text lines are detected based on the edge map of the image leading in a high recall rate with low computational time expenses. In the second stage, the result is refined using a sliding window and an SVM classifier trained on features obtained by a new Local Binary Pattern-based operator (eLBP) that describes the local edge distribution. The whole algorithm is used in a multiresolution fashion enabling detection of characters for a broad size range. Experimental results, based on a new evaluation methodology, show the promising overall performance of the system on a challenging corpus, and prove the superior discriminating ability of the proposed feature set against the best features reported in the literature. 相似文献

4.

视频监控场景下的面部遮挡检测

周仁琴《计算机工程与应用》2015,51(4):192-195

提出了一种监控场景下的面部遮挡检测方法。基于AdaBoost算法进行人脸验证,通过面部划分,分块分析是否存在遮挡情况。首先判断是否有人进入,在有人进入的情况下进行面部遮挡检测,对眼部区域采用AdaBoost方法及墨镜特征提取方法判断是否遮挡,而对嘴部区域采用高斯肤色模型进行判断。实验结果表明,该方法能实时检测面部遮挡的情况,并达到了较好的效果,适用于银行ATM等监控场景,具有较高的应用价值。相似文献

5.

A benchmark dataset and baseline model for co-salient object detection within RGB-D images

Yang Ning Zhang Chen Zhang Yumo Yang Haowei Du Ling 《Multimedia Tools and Applications》2022,81(25):35831-35842

Multimedia Tools and Applications - Within-image co-salient object detection (wCoSOD) identifies the common and salient objects within an image, which can benefit for many applications, such as... 相似文献

6.

一种交通监控场景下的多车道检测方法

王镇波余志赵建华李熙莹罗东华《计算机工程与应用》2012,48(12):14-18,23

为自动有效地获取交通监控场景中的多车道信息,提出一种利用骨架化边缘的多车道检测算法,以克服视频处理对固定场景和明确的先验车道位置信息的依赖。算法主要针对静态的交通背景图处理,采用背景提取、滤波和数字形态学预处理等,由Hough变换确定车道位置的骨架线;由行车方向约束车道线角度,利用车道线几何成像特性检测出准车道线,获取车道线和车道区域。实验表明,对不同的交通场景和不同光照条件,该方法能有效检测多车道,鲁棒性强,具有较高的工程应用价值。相似文献

7.

Illumination normalization with time-dependent intrinsic images for video surveillance 总被引：1，自引：0，他引：1

Matsushita Y Nishino K Ikeuchi K Sakauchi M 《IEEE transactions on pattern analysis and machine intelligence》2004,26(10):1336-1347

Variation in illumination conditions caused by weather, time of day, etc., makes the task difficult when building video surveillance systems of real world scenes. Especially, cast shadows produce troublesome effects, typically for object tracking from a fixed viewpoint, since it yields appearance variations of objects depending on whether they are inside or outside the shadow. In this paper, we handle such appearance variations by removing shadows in the image sequence. This can be considered as a preprocessing stage which leads to robust video surveillance. To achieve this, we propose a framework based on the idea of intrinsic images. Unlike previous methods of deriving intrinsic images, we derive time-varying reflectance images and corresponding illumination images from a sequence of images instead of assuming a single reflectance image. Using obtained illumination images, we normalize the input image sequence in terms of incident lighting distribution to eliminate shadowing effects. We also propose an illumination normalization scheme which can potentially run in real time, utilizing the illumination eigenspace, which captures the illumination variation due to weather, time of day, etc., and a shadow interpolation method based on shadow hulls. This paper describes the theory of the framework with simulation results and shows its effectiveness with object tracking results on real scene data sets. 相似文献

8.

Fuzzy clustering algorithms for unsupervised change detection in remote sensing images 总被引：4，自引：0，他引：4

Ashish Ghosh Niladri Shekhar Mishra 《Information Sciences》2011,181(4):699-715

In this paper, we propose a context-sensitive technique for unsupervised change detection in multitemporal remote sensing images. The technique is based on fuzzy clustering approach and takes care of spatial correlation between neighboring pixels of the difference image produced by comparing two images acquired on the same geographical area at different times. Since the ranges of pixel values of the difference image belonging to the two clusters (changed and unchanged) generally have overlap, fuzzy clustering techniques seem to be an appropriate and realistic choice to identify them (as we already know from pattern recognition literatures that fuzzy set can handle this type of situation very well). Two fuzzy clustering algorithms, namely fuzzy c-means (FCM) and Gustafson-Kessel clustering (GKC) algorithms have been used for this task in the proposed work. For clustering purpose various image features are extracted using the neighborhood information of pixels. Hybridization of FCM and GKC with two other optimization techniques, genetic algorithm (GA) and simulated annealing (SA), is made to further enhance the performance. To show the effectiveness of the proposed technique, experiments are conducted on two multispectral and multitemporal remote sensing images. A fuzzy cluster validity index (Xie-Beni) is used to quantitatively evaluate the performance. Results are compared with those of existing Markov random field (MRF) and neural network based algorithms and found to be superior. The proposed technique is less time consuming and unlike MRF does not require any a priori knowledge of distributions of changed and unchanged pixels. 相似文献

9.

Histogram thresholding for unsupervised change detection of remote sensing images

Swarnajyoti Patra Susmita Ghosh 《International journal of remote sensing》2013,34(21):6071-6089

The change-detection problem can be viewed as an unsupervised classification problem with two classes corresponding to changed and unchanged areas. Image differencing is a widely used approach to change detection. It is based on the idea of generating a difference image that represents the modulus of the spectral change vectors associated with each pixel in the study area. To separate out the changed and unchanged classes in the difference image automatically, any unsupervised technique can be used. Thresholding is one of the cheapest techniques among them. However, in thresholding approaches, selection of the best threshold value is not a trivial task. In this work, several non-fuzzy and fuzzy histogram thresholding techniques are investigated and compared for the change-detection problem. Experimental results, carried out on different multitemporal remote sensing images (acquired before and after an event), are used to assess the effectiveness of each of the thresholding techniques. Among all the thresholding techniques investigated here, Liu's fuzzy entropy followed by Kapur's entropy are found to be the most robust techniques. 相似文献

10.

Improving articulated hand pose detection for static finger sign recognition in RGB-D images

Elboushaki Abdessamad Hannane Rachida Afdel Karim Koutti Lahcen 《Multimedia Tools and Applications》2020,79(39-40):28925-28969

Multimedia Tools and Applications - With the emergence of consumer RGB-D sensors, discriminative modeling has been shown to perform well in estimating human body pose. However, articulated hand... 相似文献

11.

Superpixel-based active contour model for unsupervised change detection from satellite images

Ming Hao Kazhong Deng Qiyan Feng 《International journal of remote sensing》2016,37(18):4276-4295

This study proposes a superpixel-based active contour model (SACM) for unsupervised change detection from satellite images. The accuracy of change detection produced by the traditional active contour model suffers from the trade-off parameter. The SACM is designed to address this limitation through the incorporation of the spatial and statistical information of superpixels. The proposed method mainly consists of three steps. First, the difference image is created with change vector analysis method from two temporal satellite images. Second, statistical region merging method is applied on the difference image to produce a superpixel map. Finally, SACM is designed based on the superpixel map to detect changes from the difference image. The SACM incorporates spatial and statistical information and retains the accurate shapes and outlines of superpixels. Experiments were conducted on two data sets, namely Landsat-7 Enhanced Thematic Mapper Plus and SPOT 5, to validate the proposed method. Experimental results show that SACM reduces the effects of the trade-off parameter. The proposed method also increases the robustness of the traditional active contour model for input parameters and improves its effectiveness. In summary, SACM often outperforms some existing methods and provides an effective unsupervised change detection method. 相似文献

12.

基于LVQ神经网络的视频监控图像火焰检测与识别算法

王雨陈淑荣《微型机与应用》2012,31(6):39-42

针对传统火灾火焰探测技术存在不稳定、误判率高的缺点,提出了一种基于人工神经网络的火焰检测与识别算法。通过分析火焰图像的动态特性,利用火焰图像序列的离心率、放射性和整体移动等特征信息,结合学习向量量化(LVQ)神经网络进行训练仿真。实验结果表明,该算法能有效提高监控视频图像中可疑火焰的快速分类,稳定性强,具有较高的火焰识别准确率。相似文献

13.

Real-time detection of steam in video images

R.J. Ferrari Author Vitae H. Zhang Author Vitae 《Pattern recognition》2007,40(3):1148-1159

In this paper, we present a real-time image processing technique for the detection of steam in video images. The assumption made is that the presence of steam acts as a blurring process, which changes the local texture pattern of an image while reducing the amount of details. The problem of detecting steam is treated as a supervised pattern recognition problem. A statistical hidden Markov tree (HMT) model derived from the coefficients of the dual-tree complex wavelet transform (DT-CWT) in small 48×48 local regions of the image frames is used to characterize the steam texture pattern. The parameters of the HMT model are used as an input feature vector to a support vector machine (SVM) technique, specially tailored for this purpose. By detecting and determining the total area covered by steam in a video frame, a computerized image processing system can automatically decide if the frame can be used for further analysis. The proposed method was quantitatively evaluated by using a labelled image data set with video frames sampled from a real oil sand video stream. The classification results were 90% correct when compared to human labelled image frames. The technique is useful as a pre-processing step in automated image processing systems. 相似文献

14.

Optimizing human model reconstruction from RGB-D images based on skin detection

Guang Chen Jituo Li Jiping Zeng Bei Wang Guodong Lu 《Virtual Reality》2016,20(3):159-172

This paper reconstructs human model from multi-view RGB-D images of an Xbox One Kinect. We preprocess the depth images by implicit surface de-noising and then part-wisely register them into a point cloud. A template model is selected from the human model database to fit the registered point cloud of a human body by Laplacian deformation. Skin detection of RGB-D images helps to tightly constrain the skin parts of human body in template fitting step in order to get more precise and lifelike human model. We propose a robust skin detection method that is not affected by clothing pattern and background. Experiments demonstrate the effectiveness of our method. 相似文献

15.

A study on various methods used for video summarization and moving object detection for video surveillance applications

A. Senthil Murugan K. Suganya Devi A. Sivaranjani P. Srinivasan 《Multimedia Tools and Applications》2018,77(18):23273-23290

With the advancement in digital video technology, video surveillance has been playing its vital role for ensuring safety and security. The surveillance systems are deployed in wide range of applications to invigilate stuffs and to analyse the activities in the environment. From the single or multi surveillance camera, a huge amount of data is generated, stored and processed for security purpose. Due to time constraints, it is a very tedious process for an analyst to go through the full content. This limitation has been overcome by the use of video summarization. The video summarization is intended to afford comprehensible analysis of video by removing duplications and extracting key frames from the video. To make an easily interpreted outline, the various available video summarization methods will try to shot the summary of the main occurrences, scenes, or objects in a frame. Depending on the applications, it is required to summarize the happenings in the scene and detect the objects (static/dynamic) which is recorded in the video. Hence this paper provides the various methods used for video summarization and a comparative study of different techniques. It also presents different object detection, object classification and object tracking algorithms available in the literature. 相似文献

16.

A robust video watermarking technique for the tamper detection of surveillance systems

Farnaz Arab Shahidan M. Abdullah Siti Zaiton Mohd Hashim Azizah Abdul Manaf Mazdak Zamani 《Multimedia Tools and Applications》2016,75(18):10855-10885

相似文献

17.

Performance evaluation of object detection algorithms for video surveillance 总被引：3，自引：0，他引：3

Nascimento J.C. Marques J.S. 《Multimedia, IEEE Transactions on》2006,8(4):761-774

In this paper, we propose novel methods to evaluate the performance of object detection algorithms in video sequences. This procedure allows us to highlight characteristics (e.g., region splitting or merging) which are specific of the method being used. The proposed framework compares the output of the algorithm with the ground truth and measures the differences according to objective metrics. In this way it is possible to perform a fair comparison among different methods, evaluating their strengths and weaknesses and allowing the user to perform a reliable choice of the best method for a specific application. We apply this methodology to segmentation algorithms recently proposed and describe their performance. These methods were evaluated in order to assess how well they can detect moving regions in an outdoor scene in fixed-camera situations. 相似文献

18.

Automated camera sabotage detection for enhancing video surveillance systems

Sitara K. Mehtre B. M. 《Multimedia Tools and Applications》2019,78(5):5819-5841

Surveillance cameras are vital source of information in crime investigations. A surveillance video must be recorded with correct field of view and be of good quality, otherwise, it may not be suitable for investigation or analysis purposes. Perpetrators may tamper the recorded video or the physical device itself, in order to conceal their dubious activities. Generally, surveillance systems are unmanned due to limitations of manual monitoring. Automatic detection of camera tamper events is crucial for timely operator intervention. We propose a new method for detecting video camera tampering events like occlusion, defocus and displacement. The features used are edge information, frame count, foreground objects’ coverage area and its static nature. Effectiveness of our method is tested through experimentation on public datasets. The results obtained are encouraging with high detection and low false alarm rates. The proposed method automatically detects routine problems with cameras like dirt on camera lens, fog and smoke.

相似文献

19.

A novel method for pulmonary embolism detection in CTA images

Haydar Özkan Onur Osman Sinan Şahin Ali Fuat Boz 《Computer methods and programs in biomedicine》2014

In this paper, we propose a new computer-aided detection (CAD) – based method to detect pulmonary embolism (PE) in computed tomography angiography images (CTAI). Since lung vessel segmentation is the main objective to provide high sensitivity in PE detection, this method performs accurate lung vessel segmentation. To concatenate clogged vessels due to PEs, the starting region of PEs and some reference points (RPs) are determined. These RPs are detected according to the fixed anatomical structures. After lung vessel tree is segmented, the region, intensity, and size of PEs are used to distinguish them. We used the data sets that have heart disease or abnormal tissues because of lung disease except PE in this work. According to the results, 428 of 450 PEs, labeled by the radiologists from 33 patients, have been detected. The sensitivity of the developed system is 95.1% at 14.4 false positive per data set (FP/ds). With this performance, the proposed CAD system is found quite useful to use as a second reader by the radiologists. 相似文献

20.

A novel deep learning framework for copy-moveforgery detection in images

Elaskily Mohamed A. Elnemr Heba A. Sedik Ahmed Dessouky Mohamed M. El Banby Ghada M. Elshakankiry Osama A. Khalaf Ashraf A. M. Aslan Heba K. Faragallah Osama S. Abd El-Samie Fathi E. 《Multimedia Tools and Applications》2020,79(27-28):19167-19192

Multimedia Tools and Applications - In this era of technology, digital images turn out to be ubiquitous in a contemporary society and they can be generated and manipulated by a wide variety of... 相似文献