首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 218 毫秒
1.
We present here a theory of motion from a topological point of view, in a symbolic perspective. Taking space–time histories of objects as primitive entities, we introduce temporal and topological relations on the thus defined space–time to characterize classes of spatial changes. The theory thus accounts for qualitative spatial information, dealing with underspecified, symbolic information when accurate data are not available or unnecessary. We show that these structures give a basis for commonsense spatio–temporal reasoning by presenting a number of significant deductions in the theory. This can serve as a formal basis for languages describing motion events in a qualitative way.  相似文献   

2.
This paper presents a new spatio-temporal motion estimation algorithm and its VLSI architecture for video coding based on algorithm and architecture co-design methodology. The algorithm consists of the new strategies of spatio-temporal motion vector prediction, modified one-at-a-time search scheme, and multiple update paths derived from optimization theory. The hardware specification is for high-definition video coding. We applied the ME algorithm to H.264 reference software. Our algorithm surpasses recently published research and achieves close performance to full search. The VLSI implementation proves the low cost feature of our algorithm. The algorithm and architecture co-design concept is highly emphasized in this paper. We provide some quantitative example to show the necessity of algorithm and architecture co-design  相似文献   

3.
Distant microphones permit to process spontaneous multiparty speech with very little constraints on speakers, as opposed to close-talking microphones. Minimizing the constraints on speakers permits a large diversity of applications, including meeting summarization and browsing, surveillance, hearing aids, and more natural human-machine interaction. Such applications of distant microphones require to determine where and when the speakers are talking. This is inherently a multisource problem, because of background noise sources, as well as the natural tendency of multiple speakers to talk over each other. Moreover, spontaneous speech utterances are highly discontinuous, which makes it difficult to track the multiple speakers with classical filtering approaches, such as Kalman filtering of particle filters. As an alternative, this paper proposes a probabilistic framework to determine the trajectories of multiple moving speakers in the short-term only, i.e., only while they speak. Instantaneous location estimates that are close in space and time are grouped into ldquoshort-term clustersrdquo in a principled manner. Each short-term cluster determines the precise start and end times of an utterance and a short-term spatial trajectory. Contrastive experiments clearly show the benefit of using short-term clustering, on real indoor recordings with seated speakers in meetings, as well as multiple moving speakers.  相似文献   

4.
This paper derives two spatio-temporal extensions of the well-known FastICA algorithm of Hyvarinen and Oja that are applicable to the convolutive blind source separation task. Our time-domain algorithms combine multichannel spatio-temporal prewhitening via multistage least-squares linear prediction with novel adaptive procedures that impose paraunitary constraints on the multichannel separation filter. The techniques converge quickly to a separation solution without any step size selection or divergence difficulties, and unlike other methods, ours do not require special coefficient initialization procedures to obtain good separation performance. They also allow for the efficient reconstruction of individual signals as observed in the sensor measurements directly from the system parameters for single-input multiple-output blind source separation tasks. An analysis of one of the adaptive constraint procedures shows its fast convergence to a paraunitary filter bank solution. Numerical evaluations of the proposed algorithms and comparisons with several existing convolutive blind source separation techniques indicate the excellent relative performance of the proposed methods.  相似文献   

5.
图像感知哈希技术是一门较新型的技术,哈希提取过程的关键步骤是特征提取,传统的基于DCT变换的感知哈希技术抗几何攻击能力较差。试图在图像生成哈希之前,将几何形变对图像的影响去除,首先将图像进行正则化,使图像具有几何不变性,然后再进行DCT特征系数提取,最后通过量化、编码生成最终哈希。该算法可以抵抗任意的仿射变换。  相似文献   

6.
基于小波变换的视频压缩新方法   总被引:1,自引:0,他引:1  
提出了以小波变换为核心技术的视频压缩系统的具体实现方法,设计了系统的整体框架,对各个模块都提出了改进和具体实现的算法。对于帧内图像的编码,提出了Mallat小波变换的改进延拓算法,加入了感觉加权量化。对于帧间图像的编码,针对低码率电视会议的特点改进了三步搜索法,在保持低复杂度的基础上提高了搜索的精度,同时为了提高编码效率,对运动补偿后的残差图像也进行小波编码。实验表明,这种算法的性能优于传统的视频压缩方法。  相似文献   

7.
提出一种基于鲁棒Hash的视频拷贝检测方案.通过对特征点进行分类,选取在时空域上持久存在的稳定点,对邻域点进行微分计算构造局部特征.将多维特征数据进行Hilbert编码,并选取有效位作为检测Hash码.为了准确的在目标视频中定位可疑内容,提出了Hash匹配方案,将序列相似度作为匹配的依据,提高匹配精度.实验结果表明本方案拥有较好检测性能,适用于视频内容的拷贝检测.  相似文献   

8.
Hashing is a common solution for content-based multimedia retrieval by encoding high-dimensional feature vectors into short binary codes. Previous works mainly focus on image hashing problem. However, these methods can not be directly used for video hashing, as videos contain not only spatial structure within each frame, but also temporal correlation between successive frames. Several researchers proposed to handle this by encoding the extracted key frames, but these frame-based methods are time-consuming in real applications. Other researchers proposed to characterize the video by averaging the spatial features of frames and then the existing hashing methods can be adopted. Unfortunately, the sort of “video” features does not take the correlation between frames into consideration and may lead to the loss of the temporal information. Therefore, in this paper, we propose a novel unsupervised video hashing framework via deep neural network, which performs video hashing by incorporating the temporal structure as well as the conventional spatial structure. Specially, the spatial features of videos are obtained by utilizing convolutional neural network, and the temporal features are established via long-short term memory. After that, the time series pooling strategy is employed to obtain the single feature vector for each video. The obtained spatio-temporal feature can be applied to many existing unsupervised hashing methods. Experimental results on two real datasets indicate that by employing the spatio-temporal features, our hashing method significantly improves the performance of existing methods which only deploy the spatial features, and meanwhile obtains higher mean average precision compared with the state-of-the-art video hashing methods.  相似文献   

9.
姚涛  阙大顺  王刚  于泓 《计算机工程》2011,37(8):143-145
提出一种基于Contourlet变换的双重视频水印方案。对MPEG系统中的I帧进行Contourlet变换,利用二维离散超混沌系统产生的超混沌序列对水印图像进行置乱和加密处理。在低频域重要系数嵌入破坏原始视频水印,使其失去应用价值,同时在高频域嵌入鲁棒水印,实现版权保护。实验结果表明,只有得到授权的用户才能获得高保真的视频信息,版权保护水印提取时不需要原始视频信号。  相似文献   

10.
基于DirectShow设计视频加密Transform Filter   总被引:4,自引:0,他引:4  
实现了基于DirectShow技术开发视频加密TransformFilter。首先介绍了视频加密的原理和方案,分析了Di-rectShow的体系结构,然后研究了基于DirectShow技术如何实现视频加密TransformFilter,并给出了试验结果和结论。试验结果表明该Filter可以对实时视频进行加密,加密速度快,效果好。  相似文献   

11.
文静  韩叶飞 《计算机工程》2010,36(19):239-240,243
针对视频水印系统的安全性问题,提出一种基于三维小波变换的视频散列算法。该算法对预处理后的三维视频序列依次进行行方向一维小波变换、列方向一维小波变换和时间方向一维小波变换,完成视频序列的一层三维离散小波变换,并给出散列计算结果。仿真实验证明,该算法对帧内像素移动攻击及随机帧抖动攻击具有较强的鲁棒性。  相似文献   

12.
相丽  潘峰  杨晓元  钮可 《计算机工程》2010,36(23):136-138
结合离散余弦变换(DCT)的能量压缩能力和离散小波变换(DWT)的多分辨率特性,提出一种基于准三维变换的视频隐写算法。该算法利用二维小波变换描述视频序列的帧内信息,通过一维离散余弦变换描述视频序列的帧间信息,从而有效利用视频序列的空间及时间冗余,将秘密信息嵌入在DCT系数的量化过程中,保证了视频序列的稳定性及秘密信息提取的准确性。实验结果表明,该算法具有较好的视觉不可见性与较高的嵌入容量,且可实现秘密信息的盲提取。  相似文献   

13.
提出了一种人体运动姿态视频检索的新方法,整体算法分为典型姿态学习和姿态检索两个阶段。首先提取样本库中人体姿态的时空运动特征点作为姿态运动底层特征,一个姿态对应一个时空特征点集合;计算每个特征点的时空三维邻域中像素的梯度,进而为每个姿态建立一个梯度直方图;其次,采用非监督的聚类方法对姿态样本归类,按照语义要求提取多个典型姿态;最后,用基于EM的高斯混合模型对聚类结果建模,形成典型姿态检索的分类器,完成姿态建模的的学习阶段。运动姿态的视频检索是根据最大概率匹配准则,对输入的测试视频进行姿态匹配,从而实现基于语义的姿态检索。基于Weizmann和KTH标准测试视频库的大量实验结果表明,本文提出的方法能够准确有效地检索人体运动姿态。  相似文献   

14.
随着Internet和无线通信的飞速发展,人们在网络上实时获取视频数据已经成为可能.由于传输网络和接收终端的多样性,所以需要视频流能适应多种不同传输、解码和显示的要求,由此产生了可分级视频编码.文中在可分级视频编码的基础上对其进行改进,把感兴趣区域的检测和小波变换应用到可分级编码中,该方法中对视频流进行感兴趣区域检测,利用小波变换对增强层中的感兴趣区域进行编码,由于小波变换具有良好的空间方向选择性,与人眼的视觉特性十分吻合,从而得到很好的效果.  相似文献   

15.
随着Internet和无线通信的飞速发展,人们在网络上实时获取视频数据已经成为可能。由于传输网络和接收终端的多样性,所以需要视频流能适应多种不同传输、解码和显示的要求,由此产生了可分级视频编码。文中在可分级视频编码的基础上对其进行改进,把感兴趣区域的检测和小波变换应用到可分级编码中,该方法中对视频流进行感兴趣区域检测,利用小波变换对增强层中的感兴趣区域进行编码,由于小波变换具有良好的空间方向选择性,与人眼的视觉特性十分吻合,从而得到很好的效果。  相似文献   

16.
基于特征匹配与仿射变换的视频防抖算法   总被引:1,自引:0,他引:1  
庞洵  李威  高小伟 《微计算机信息》2008,24(12):180-182
本文提出了一种利用提取尺度不变的特征点的算法,根据视频图像的特点,建立独特的快速视频防抖的仿射变换模型,从而实现视频图像的高可靠性防抖的算法.由这种算法编写的软件可以达到消除摄像机随机抖动带来的运动噪声的效果.另外,本文还提出了用来提高处理速度的实用方案.实验表明该算法具有良好的防抖效果,能够用于视频图像的实时处理.  相似文献   

17.
在视频水印的评价标准中,鲁棒性和透明性是一对非常重要的性能指标,同时也是一对相互制约的指标。已有的视频水印算法对上述2个指标都采用折衷的方法,为保证透明性通常不能兼顾水印的鲁棒性。为解决上述问题,提出一种可移除数字视频水印算法。在嵌入水印时,不限制水印嵌入的强度,以保证水印信息的鲁棒性,而嵌入后的视频在播放时,需要经过一个水印移除的过程,移除水印后的视频在视觉效果上与原视频接近或完全一致,以满足水印的透明性。同时,用户需提供合法的密钥,以保证水印移除的正确性,并提高水印算法的安全性。测试结果表明,该算法对分辨率放缩、视频重编码及码率压缩的操作具有较强的鲁棒性。  相似文献   

18.
构建基于离散小波的多视角编码系统。对多视角视频进行四维小波分解,包括一维时间分解、一维视角分解和二维空间分解。利用小波的多分辨特性和嵌入式编码器时间、空间、视角及质量可伸缩的特点,在一定带宽限制下,采用率失真优化的方法对各个子带的码率进行优化截断。实验结果表明,该编解码系统具有较好的视频重建图像质量。  相似文献   

19.
为克服在图像上直接使用快速离散Curvelet变换不能完全提取有用特征信息的缺点,利用HSI颜色空间各通道互不相关的特点,结合快速离散Curvelet变换,提出一种新的视频人脸识别方法,并设计一个视频人脸识别系统以证明该方法的有效性。实验结果表明,该系统只需2 s (35帧/s)即可识别人脸,正确识别率达99.5%,对表情有一定的鲁棒性,可在  相似文献   

20.
基于方向提升小波变换的多描述视频编码   总被引:1,自引:0,他引:1       下载免费PDF全文
陈锦春  蔡灿辉 《计算机工程》2009,35(22):162-164
提出一种基于方向提升小波变换的多描述可分级视频编码方法,该方法能增强视频编码的容错性和网络自适应性。对视频序列中的每一帧图像按梅花下采样产生2个子序列,对每个子序列分别进行三维小波可分级视频编码形成2个描述。为解决传统二维小波变换方法对梅花下采样图像编码效率低的问题,引入方向提升小波变换对图像进行空域分解,提出5模式方向提升小波变换以进一步提高编码效率。实验结果证明了该方法的有效性。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号