首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
局部运动场景中运动对象的压缩域分割算法   总被引:1,自引:1,他引:0  
压缩域视频运动对象的分割技术近年来逐渐被人们所重视,并且成为较实用的技术之一.该文针对目前压缩域中运动对象分割质量不高的问题,提出一种压缩域分割的改进算法.该算法首先从MPEG压缩码流中提取运动矢量,并对矢量进行运动累加和中值滤波处理,最后利用贝叶斯风险决策对非零运动矢量区域进行校正,最终得到较好的分割结果.该文还定义了压缩域分割质量的评价参数分割率和错分率,为评价压缩域的分割结果提供了客观评价标准.  相似文献   

2.
面向语义视频检索,提出一种压缩域的目标分割新算法。它直接基于压缩码流中运动矢量和DCT系数,经过运动检测、矢量分水岭分割、目标融合与修正、后处理与跟踪等步骤提取空时视频目标。整个过程主要基于压缩域进行,无需视频码流的完全解码。对不同测试序列的实验测试结果显示算法能基于压缩域提取较为精确的空时视频目标,并具有较好的鲁棒性。  相似文献   

3.
Based on the classical fractal video compression method, an improved object-based stereo video compression scheme with Shape-Adaptive DCT is proposed in this paper. Firstly, we use more effective macroblock partition scheme instead of classical quadtree partition scheme; thus reducing the block searching strategy. The stereo fractal video coding is proposed which matches the macroblock with two reference frames in left and right view results in increasing compression ratio and reducing bit rate when transmitting compressed stereo data. The stereo codec combines the Motion Compensation Prediction (MCP) and Disparity Compensation Prediction (DCP). Fractal coding is adopted and each object is encoded independently by a prior video segmentation alpha plane, which is defined exactly as in MPEG-4. The testing results with the nature monocular and stereo video sequences provide promising performances at low bit rate coding. We believe it will be a powerful and efficient technique for the object-based monocular and stereo video sequences coding.  相似文献   

4.
This paper proposes a novel robust video watermarking scheme based on local affine invariant features in the compressed domain. This scheme is resilient to geometric distortions and quite suitable for DCT-encoded compressed video data because it performs directly in the block DCTs domain. In order to synchronize the watermark, we use local invariant feature points obtained through the Harris-Affine detector which is invariant to affine distortions. To decode the frames from DCT domain to the spatial domain as fast as possible, a fast inter-transformation between block DCTs and sub-block DCTs is employed and down-sampling frames in the spatial domain are obtained by replacing each sub-blocks DCT of 2×2 pixels with half of the corresponding DC coefficient. The above-mentioned strategy can significantly save computational cost in comparison with the conventional method which accomplishes the same task via inverse DCT (IDCT). The watermark detection is performed in spatial domain along with the decoded video playing. So it is not sensitive to the video format conversion. Experimental results demonstrate that the proposed scheme is transparent and robust to signal-processing attacks, geometric distortions including rotation, scaling, aspect ratio changes, linear geometric transforms, cropping and combinations of several attacks, frame dropping, and frame rate conversion.  相似文献   

5.
提出了一种基于模糊聚类的视频对象分割方法.首先通过对连续三帧视频图像进行二次差分来得到二次差分图像;然后估计噪声的特征参数滤除背景噪声,提取出视频对象的运动区域;再利用改进的FCM聚类算法对二次帧差图像中的视频对象运动区域进行空域分割,对空域分割结果进行形态学处理,得到视频对象掩模;最终获得较为理想的视频对象.实验结果表明,该算法能够较为准确地分割出视频对象,并且在空间准确度上占优.  相似文献   

6.
A scheme based on a difference scheme using object structures and color analysis is proposed for video object segmentation in rainy situations. Since shadows and color reflections on the wet ground pose problems for conventional video object segmentation, the proposed method combines the background construction-based video object segmentation and the foreground extraction-based video object segmentation where pixels in both the foreground and background from a video sequence are separated using histogram-based change detection from which the background can be constructed and detection of the initial moving object masks based on a frame difference mask and a background subtraction mask can be further used to obtain coarse object regions. Shadow regions and color-reflection regions on the wet ground are removed from the initial moving object masks via a diamond window mask and color analysis of the moving object. Finally, the boundary of the moving object is refined using connected component labeling and morphological operations. Experimental results show that the proposed method performs well for video object segmentation in rainy situations.  相似文献   

7.
为了解决当前目标跟踪中目标轮廓提取不精确的问 题,在对传统GVF (gradient vector flow)snake活动轮廓模型改进的基础上,提 出一种基于变化检测和改进的GVF snake活动轮廓模型的视频目标轮廓提取算法。首先,通 过 基于t显著性检验的变化检测方 法消除背景边界的影响,并获取初始运动变化区域的临界四边形作为GVF snake的初始轮廓 。然后,对初始轮廓应用改进 的GVF snake模型以获得精确的轮廓边界。改进模型采用4方向各项异性扩散,并采用下降速 度较快的保真项系数以增强 GVF snake进入凹陷的能力,且保持对弱边界的收敛。本文方法克服了手动绘制初始轮廓的 缺点,对传统GVF snake方法进 行了改进,且空间准确度(SA)有很大提高。实验表明 ,本文方法成功分割出目标凹陷部分并对弱边界有较好的收敛效果,提高了轮廓提取的精确 度。  相似文献   

8.
Moving object segmentation in DCT-based compressed video   总被引:2,自引:0,他引:2  
A block-based automatic segmentation algorithm has been developed for detecting and tracking moving objects in DCT-based compressed video. The proposed algorithm segments moving objects with block resolution using the stochastic behaviour of the image blocks in the DCT domain  相似文献   

9.
Robustness analysis on facial image description in DCT domain   总被引:1,自引:0,他引:1  
  相似文献   

10.
该文提出了一种工作于MPEG压缩域的快速视频目标分割算法。该算法以从MPEG1/2码流中部分解码提取的特征为输入,提取P帧中的运动目标。针对一般的压缩域算法目标边界精度不高的特点,算法采用I帧和P帧中每个块的直流DCT系数和3个交流DCT系数,以及运动补偿信息,重建出P帧的原图像1/16大小的子图像,采用快速平均移聚类得到具有较高边界精度的亮度一致的区域;针对运动向量的噪声容易造成错误检测的缺点,算法结合聚类分析结果和运动块的分布,采用基于马尔可夫随机场的统计标号方法对目标和背景区域进行分类,得到每个P帧的目标掩模。该算法可以得到44子块的边界精度,对于CIF格式的码流,在Pentium IV 2GHz平台上可以达到每秒40帧的处理速度。  相似文献   

11.
视频对象分割与两种面向对象的视频编码器   总被引:9,自引:0,他引:9       下载免费PDF全文
翁南钐  蔡德钧 《电子学报》2000,28(10):106-110
在基于对象的视频编码中,视频对象的分割是重要的任务.本文研究一种利用位移帧差(DFD)的高阶统计特性和数学形态学算子的视频对象自动分割方法.这种方法首先根据一组转移帧差的高阶矩来得到一个大体覆盖运动对象的区域(模板),接着用形态学的腐蚀算子从模板的边沿向内腐蚀,直到对象的边沿.提出一种简单而高效的基于查找人头肩像轮廓最大转折点的头肩分离算法.在分割的基础上,用软件实现了一种基于MPEG-4的视频编码系统.提出一种面向对象分配带宽(OOBA——object-oriented bandwidth allocation)的极低比特率视频编码器.该编码器与传统基于帧的视频编码器相比,在低比特率环境下,PSNR略为下降,但图像的主观视觉质量得到提高.  相似文献   

12.
结合多种语义信息的半自动视频对象分割   总被引:2,自引:2,他引:0       下载免费PDF全文
陈韩锋  戚飞虎 《电子学报》2002,30(Z1):2012-2015
本文提出了一种半自动视频对象分割方法.该方法结合了多种视频语义信息,来提高分割的效率和分割方法的通用性.在视频的初始帧中用半自动绘制的多边形初始化目标对象区域;然后在后续帧中自动跟踪对象区域,对于一般性对象采用基于后向块匹配的像素对应方法进行跟踪;针对平移运动的刚体对象和变化很缓慢的对象本文提出了专门的跟踪方法;最后利用一种基于同等组的模板修正方法来修正每一帧的分割模板.利用本文的方法获得了精确稳定的实验结果.  相似文献   

13.
一种自适应采样率视频压缩感知方案   总被引:1,自引:1,他引:0  
为了进一步提高视频压缩感知方案的重构图像质量,提出了一种新的自适应采样方案.在该方案中,根据不同图像块的稀疏度自适应分配采样率.在对各图像块分类判决时,首先判断图像块在离散余弦变换域的稀疏度,其次结合该图像块与其参考帧之间的时域相关性,确定图像块的分类.实验结果表明,与现有的自适应采样率分配方案相比,该算法可获得0.5 dB左右的峰值信噪比增益.  相似文献   

14.
赵慧民  赖剑煌  蔡君  陈小玲 《电子学报》2013,41(6):1153-1158
 针对视频水印在帧内篡改检测方面定位精度的不足,通过压缩感知对MPEG-4(Moving Picture Experts Group-4)视频内容的特征表示,提出一种新的视频水印生成方法及其帧内篡改检测算法.该算法由压缩感知DCT(Discrete Cosine Transform)测量矩阵对I-VOP(Intra-Video Object Plane)图像提取U、V特征参数,生成基于内容的压缩感知视频水印数据并嵌入到图像Y分量的DCT中高频系数中实现帧内篡改检测.实验结果表明,与Hash视频水印算法比较,压缩感知视频水印数据具有更好的恢复能力,且水印算法对视频帧内篡改定位精度更高.  相似文献   

15.
赵锟  张兆扬  刘志 《电视技术》2007,31(9):15-17,26
讨论了压缩域视频对象分割技术的发展和研究现状,介绍了几种典型的分割算法,并具体指出一些需要深入研究的问题.  相似文献   

16.
This paper presents a novel coarse to fine moving object segmentation framework for H.264/AVC compressed videos. The proposed framework integrates the global motion estimation and global motion compensation steps in the segmentation pipeline unlike previous techniques which did not consider such an integration. The integration is based on testing for presence of global motion by classifying the interframe motion vectors into moving camera class and still camera class. The decision boundary separating these two classes is learnt from the training video data. The integration automates the moving object segmentation to be applicable for static, moving and combination of static/moving camera cases which to the best of our knowledge has not been carried out earlier. Further, a novel coarse segmentation technique is proposed by decomposing the inter-frame motion vectors into wavelet sub-bands and utilizing logical operations on LH, HL and HH sub-band wavelet coefficients. The premise is based on the fact that since the LH, HL and HH sub-bands contain the detail information pertaining to horizontal, vertical and diagonal moving blocks respectively, they can be exploited to identify the coarse moving boundaries. The coarse segmentation is fast in comparison to state-of-the-art coarse segmentation methods as demonstrated by our experiments. Finally, these coarse boundaries are modeled in an energy minimization framework and shown that by minimizing the energy using graph cut optimization the segmentation is refined to obtain the fine segmentation. The proposed framework is tested on a number of standard video sequences encoded with H.264/AVC JM encoder and comparison is carried out with state-of-the-art compressed domain moving object segmentation methods as well as with an existing state-of-the-art pixel domain method to establish and validate the proposed moving object segmentation framework.  相似文献   

17.
一种基于内容的快速视频运动对象分割算法   总被引:1,自引:0,他引:1  
自从MPEG-4和MPEG-7标准公布以来。基于内容的视频编码与基于对象形状的检索成为视频领域新的发展趋势,而准确地从背景中分割出视频对象是实现上述功能的前提条件。对视频运动对象分割算法进行了研究,该算法结合对称差分和自适应阚值选取,获得变化检测模板,并最终分割出视频运动对象。实验结果表明该算法对运动对象敏感,能实时准确地分割出视频运动对象。但对象的关节运动以及背景的全局运动都将导致分割精度的降低,这也是今后研究的重点。  相似文献   

18.
李晓光  李晓华  沈兰荪 《电子学报》2005,33(12):2170-2173
压缩域人脸检测在图像/视频信息处理中具有重要意义.本文提出了一种基于多级梯度能量特征的DCT (Discrete Cosine Transform)压缩域人脸检测算法.依据DCT压缩图像色差信号的直流系数进行肤色分割,减小检测范围.在分割为肤色的区域提取多级梯度能量特征,即利用不同大小的检测窗口提取归一化的特征向量,表示不同大小的人脸.特征向量输入到级联分类器中分类,确定是否表示人脸.级联分类器由若干简单分类器和一个神经网络分类器构成.简单分类器利用一些先验知识排除大部分明显不是人脸的特征向量,通过简单分类器的特征由神经网络最终确定是否表示人脸.多级梯度能量特征与DCT域图像缩放相结合实现了对不同大小人脸的快速检测.对多级梯度能量特征的定义,减少了检测算法中压缩域图像缩放的次数,从而大幅度减少了计算复杂度,提高了检测速度.实验结果表明提出的多级梯度能量特征可有效描述DCT域人脸模式,同时也证明了该算法的快速有效性.  相似文献   

19.
Video text information plays an important role in semantic-based video analysis, indexing and retrieval. Video texts are closely related to the content of a video. Usually, the fundamental steps of text-based video analysis, browsing and retrieval consist of video text detection, localization, tracking, segmentation and recognition. Video sequences are commonly stored in compressed formats where MPEG coding techniques are often adopted. In this paper, a unified framework for text detection, localization, and tracking in compressed videos using the discrete cosines transform (DCT) coefficients is proposed. A coarse to fine text detection method is used to find text blocks in terms of the block DCT texture intensity information. The DCT texture intensity of an 8×8 block of an intra-frame is approximately represented by seven AC coefficients. The candidate text block regions are further verified and refined. The text block region localization and tracking are carried out by virtue of the horizontal and vertical block texture intensity projection profiles. The appearing and disappearing frames of each text line are determined by the text tracking. The final experimental results show the effectiveness of the proposed methods.  相似文献   

20.
The new MPEG-4 video coding standard enables content-based functions. In order to support the new standard, frames should be decomposed into Video Object Planes (VOP), each VOP representing a moving object. This paper proposes an image segmentation method to separate moving objects from image sequences. The proposed method utilizes the spatial-temporal information. Spatial segmentation is applied to divide each image into connected areas and to find pre~:ise object boundaries of moving objects. To locate moving objects in image sequences, two consecutive image frames in the temporal direction are examined and a hypothesis testing is performed with Neyman-Pearson criterion. Spatial segmentation produces a spatial segmentation mask, and temporal segmentation yields a change detection mask that indicates moving objects and the background. Then spatial-temporal merging can be used to get the final results. This method has been tested on several images. Experimental results show that this segmentation method is efficient.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号