首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
视频文字信息在基于语义的视频分析、检索、提取中占有重要地位。根据视频中文字和背景的灰度变化程度不同,提出一种基于梯度离散余弦变换的视频文字定位方法:先对视频帧进行NX V分块,计算每一块的离散余弦变换系数,然后求出梯度算子的幅值,利用得到的幅值作为块强度进行平滑滤波以及形态学处理,最后对图像进行水平和垂直方向投影,统计字幕条数,并利用文本框标识文字区域,进而达到对视频文字定位的目的。仿真结果表明这种视频文字定位方法对于静态文字和滚动字幕的定位均是可行的,且其算法的运行速度快、效率高,特别是对于笔画较少的文字定位准确,不会出现遗漏现象。  相似文献   

2.
This paper presents a real time video watermarking algorithm based on the singular value decomposition (SVD). The embedding procedure combines the DCT and SVD. The SVD is applied on the low frequency AC coefficients of block DCT of the frame. Then the relation of neighbor coefficients of those middle frequency bands in the singular values is modified. The hidden information can be extracted without the original video or the order of the original singular values, i.e., this is a blind watermarking algorithm. Experiments show that the algorithm is very robust against MPEG-2 compression, median filtering, small shift, and rotation, etc.  相似文献   

3.
针对窄带网络的视频信号传输问题,分析了传统视频代码转换帧速率转换时,由于运动矢量非最佳化所造成的图象质量下降的原因,并提出了一种基于量化误差的自自动化运动矢量模型,从而减小了搜索域,使最佳化输出运行矢量能进行快速运动估值;同时根据灰度系统理论,提出了一种有效的灰度预测搜索方法,另外,又根据DCT系数理论模型。采用自适应快速视频编码方法,进一步提高了编码速度,实验结果表明:该方法不仅改善了视频图象质量,而且计算复杂度也大大减小。  相似文献   

4.
针对H.264/AVC (Advanced Video Coding)视频信息隐藏过程中产生的不可感知性、嵌入容量、鲁棒性三方面不平衡的问题,提出一种通过选取不同优先级顺序的DCT (Discrete Cosine Transform)系数集合实现嵌入,进而减少帧间漂移失真的信息隐藏算法.首先对秘密信息M进行卷积编码,得到编码后的信息,以此提高视频的鲁棒性;然后通过分析帧间漂移失真产生的原因,计算出不同DCT系数优先级顺序的分类集合,优先在失真小的系数集合中嵌入数据从而减小帧间漂移失真;最后根据预先定义的嵌入规则,将嵌入在所选择4×4亮度块的DCT系数集合中,提高嵌入容量.最终在解码端能够正确提取秘密信息同时恢复原始信息.实验结果表明,本文提出的算法能在很好的保证视频质量的前提下,增加视频的嵌入容量,提高鲁棒性.  相似文献   

5.
DCT域图象处理和特征提取技术   总被引:7,自引:1,他引:7       下载免费PDF全文
现今 ,大量的图象与视频信息都是以压缩数据格式进行存储和传输的 .DCT(Discrete Cosine Transform离散余弦变换 )是目前应用最为广泛的多媒体数据压缩技术之一 .直接在 DCT域实现如视频编辑、特征提取等传统空域处理技术 ,能够避免繁琐的压缩数据编、解码操作 ,减少处理时间和数据处理量 ,节省内存空间 .这一技术对于高速海量的数据处理场合 ,如 Internet信息检索、视频编辑和检索、远程监视图象的理解等 ,是很有吸引力的 ,因此其是近年来国际上有关领域的研究热点之一 .本文对近年来文献中所见的 DCT域图象处理和特征提取技术进行了回顾和综述 ,并在此基础上对其发展方向进行了探讨  相似文献   

6.
为解决网络视频的非法拷贝问题,提出一种基于峭度图像的视频指纹算法。对视频片段进行预处理后,利用均匀分布的随机变量提取关键帧以及关键帧的峭度图像,并对峭度图像进行离散余弦变换(DCT),采用较大的DCT系数构造视频指纹,在视频指纹的匹配过程中,通过滑动窗的方法对不同长度的指纹进行匹配,从而达到视频认证的目的。实验结果证明,该算法提取的视频指纹在常见视频攻击下误码率均小于10%。  相似文献   

7.
一种从压缩视频流中提取关键帧的方法   总被引:13,自引:4,他引:13  
关键帧提取技术是视频分析和基于内容的视频检索的基础。关键帧的使用大大减少了视频索引的数据量,同时也为视频和检索提供了一个组织框架。论文首先简单介绍目前的关键帧提取技术,然后提出了一种基于DC系数和运动矢量直接从MPEG压缩视频提取关键帧的方法,无需全部解压,计算复杂度低,大大提高了提取速度。实验证明该方法计算量小,可以较好地代表视频内容。  相似文献   

8.
基于DC系数和运动矢量的快速场景分割算法   总被引:1,自引:0,他引:1  
场景分割技术是动态视频分析和基于内容的视频检索的基础,以检测出来的场景作为基本单元,可以进一步对视频内容进行分析和建立索引.本文旨在提出一种基于MPEG压缩视频流的场景分割算法,利用MPEG数据流中已有的DCT DC系数和运动矢量,来检测场景的变换,从而实现场景分割,针对实际视频流中场景突变和渐变两类变换.本文提出两种方法分别处理不同情况,对于突变检测,该算法可以定位到帧,由于该算法进行最小程度的解码,降低了计算复杂度,因而大大提高了检测速度.  相似文献   

9.
基于混沌的AVS视频压缩加密算法   总被引:2,自引:1,他引:1       下载免费PDF全文
宋永中  王毅  刘东华 《计算机工程》2009,35(14):158-159
针对AVS的编码结构,提出一种基于混沌理论的AVS视频加密算法,并集成到AVS编/解码器中。该算法采用实值混沌序列加密和置乱DCT非零系数来加密视频信息。采用3个标准视频序列进行算法仿真,在为AVS编写的rm52j软件平台上进行实验,结果表明,该算法加密速度快,安全性高,对压缩比影响较少。  相似文献   

10.
A novel spatio-temporal filter for video denoising, which operates entirely in the wavelet domain, is proposed. For effective noise reduction, the spatial and temporal redundancies that exist in the wavelet domain representation of a video signal are exploited. First, a 2D discrete wavelet transform is applied to the input noisy frames. This is followed by a discrete cosine transform (DCT), which is applied to the temporal subband coefficients to minimise the redundancy among the consecutive frames. The DCT transformed, noise-free coefficients in the different wavelet domain subbands for the original image sequence are modelled using a prior having a generalised Gaussian distribution. On the basis of this prior, filtering of the noisy wavelet coefficients in each subband is carried out using a new, low-complexity wavelet shrinkage method, which utilises the correlation that exists between subsequent resolution levels. Experimental results show that the proposed scheme outperforms several state-of-the-art spatio-temporal filters in terms of both the peak signal-to-noise ratio and the visual quality  相似文献   

11.
一种基于DCT和NCG的数字视频水印算法   总被引:2,自引:0,他引:2  
随着计算机和网络技术的快速发展,数字水印技术正成为一种数字媒体知识产权保护的有效手段.提出一种基于DCT和NCG的数字视频水印算法.它根据NCG,对视频帧进行分类,选择具有复杂纹理特性的帧,进行分块DCT,然后比较DCT后的一对中频系数而嵌入水印.仿真实验表明,该算法具有较好的透明性,能实现水印盲检测,可抵抗常见的视频处理及攻击,如压缩、滤波、添加噪声、裁剪等.  相似文献   

12.
由于视频数据是时间轴上的二维图像序列,提出了利用分形和三维离散余弦变换相结合的视频编码方法。利用三维离散变换把三维视频数据从时空域变换到频域中,再利用分形技术在频域中来寻找对应每个可变三维频域值域块的最佳定义域块匹配。由于频域中DCT系数的强相关性和分形的高压缩性能,能够实现视频数据的高压缩。试验证明对于非实时处理低比特流视频,有一些应用前景。  相似文献   

13.
提出了一种在H.264中减小DCT变换和量化计算量的新的有效方法。通过理论分析,研究了Normal4×4,LumaDC4×4和ChramaDC2×2这三种变换的系数动态分布,进而对变换和量化方法的三种类型提出相应的DCT系数量化为零三个充分条件。与文献中所提到的其它方法相比,该方法更加有效、精确。理论分析和实验结果表明:在减小计算复杂度、编码视频质量、错误接受率(false acceptance rate),错误拒绝率(false rejection rate)等方面,该方法都优于其它方法。  相似文献   

14.
增强无线视频图像传输差错恢复能力的方法研究   总被引:3,自引:0,他引:3       下载免费PDF全文
对无线网络环境下视频图像的鲁棒性传输问题进行了研究,提出了一种增强视频图像传输差错恢复能力的新方法。在编码器端采用基于DCT(discrete cosine transform)矩阵系数的快速分类算法提取图像区域分类特征与再同步信息等构成指示信息,并用考虑量化误差后改进的DCT系数强制奇偶修改方法将其嵌入到图像码流中,用于解码器端视频图像序列的差错检测、再同步和差错恢复,以此增强基于H.263 的视频图像传输的鲁棒性。实验结果表明,此方法可以有效地改善标准编解码器的传输质量,不额外增加信源传输码字,计算成本较小。  相似文献   

15.
A visual attention-based bit allocation strategy for video compression is proposed. Saliency-based attention prediction is used to detect interesting regions in video. From the top salient locations from the computed saliency map, a guidance map is generated to guide the bit allocation strategy through a new constrained global optimization approach, which can be solved in a closed form and independently of video frame content. Fifty video sequences (300 frames each) and eye-tracking data from 14 subjects were collected to evaluate both the accuracy of the attention prediction model and the subjective quality of the encoded video. Results show that the area under the curve of the guidance map is 0.773 ± 0.002, significantly above chance (0.500). Using a new eye-tracking-weighted PSNR (EWPSNR) measure of subjective quality, more than 90% of the encoded video clips with the proposed method achieve better subjective quality compared to standard encoding with matched bit rate. The improvement in EWPSNR is up to over 2 dB and on average 0.79 dB.  相似文献   

16.
In this paper, a visual object tracking method is proposed based on sparse 2-dimensional discrete cosine transform (2D DCT) coefficients as discriminative features. To select the discriminative DCT coefficients, we give two propositions. The propositions select the features based on estimated mean of feature distributions in each frame. Some intermediate tracking instances are obtained by (a) computing feature similarity using kernel, (b) finding the maximum classifier score computed using ratio classifier, and (c) combinations of both. Another intermediate tracking instance is obtained using incremental subspace learning method. The final tracked instance amongst the intermediate instances are selected by using a discriminative linear classifier learned in each frame. The linear classifier is updated in each frame using some of the intermediate tracked instances. The proposed method has a better tracking performance as compared to state-of-the-art video trackers in a dataset of 50 challenging video sequences.  相似文献   

17.
王田 《计算机工程》2004,30(22):157-159
提出了一种增强无线网络环境下视频图像传输差错恢复能力的新方法。利用改进的DCT系数强制奇偶修改方法将辅助信息嵌入到视频图像中,用于解码器端完成视频图像序列差错检测,再同步,差错恢复,以增强H.263 标准解码器的差错恢复功能,获得更好的图像重构质量,实验仿真结果表明此方法可以在不额外增加信源传输码字的前提下,有效地改善传输质量。  相似文献   

18.
This paper presents a unique set of techniques to support reliable and efficient video transmission over mobile channels. The transmission system is comprised of an M level Quadrature Amplitude Modulation (QAM) technique. A twin class uniform and non-uniform partitioned M-QAM system is used to transport a compressed video bitstream which is partitioned to match the bit-error sensitivity of the transmitted symbol in terms of mapping in the constellation diagram and picture quality. Video partitioning based on a separation of the Variable Length Coded (VLC) Discrete Cosine Transforms (DCT) coefficients within each block is considered for constant bitrate transmission (CBR). Various scenarios for splitting the bitstream are investigated and their results are compared and analysed thoroughly. The performance of the transmission system is evaluated under Additive White Gaussian Noise (AWGN) conditions. The simulation results showed that the video partition strategy results in a significantly higher quality of the reconstructed video data.  相似文献   

19.
This paper presents a unique set of techniques to support reliable and efficient video transmission over mobile channels. The transmission system is comprised of an M level Quadrature Amplitude Modulation (QAM) technique. A twin class uniform and non-uniform partitioned M-QAM system is used to transport a compressed video bitstream which is partitioned to match the bit-error sensitivity of the transmitted symbol in terms of mapping in the constellation diagram and picture quality. Video partitioning based on a separation of the Variable Length Coded (VLC) Discrete Cosine Transforms (DCT) coefficients within each block is considered for constant bitrate transmission (CBR). Various scenarios for splitting the bitstream are investigated and their results are compared and analysed thoroughly. The performance of the transmission system is evaluated under Additive White Gaussian Noise (AWGN) conditions. The simulation results showed that the video partition strategy results in a significantly higher quality of the reconstructed video data.  相似文献   

20.
目前大部分压缩域视频对象的分割方法主要面向MPEG系列视频标准,且算法建模复杂。为了解决这一问题,现提出了一种新的基于H.264/AVC的压缩域时空联合运动对象分割(TSMOS)算法。该方法主要利用压缩码流中的DCT系数和运动矢量信息进行对象分割,并首先利用相邻帧DCT系数之差提取运动对象轮廓,同时通过对轮廓进行形态学和抗噪声处理来得到粗糙的运动对象帧差掩码;然后采用运动向量归一化、噪声向量滤除、权值扩展向量中值(WEVM)滤波及前帧分割结果后向投影技术来得到对象的运动掩码;最后通过引入有效机制合并帧差掩码和运动掩码来分割运动对象。实验证明,该算法可取得较好的分割效果。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号