首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Video shot boundary detection (SBD) is a fundamental step in automatic video content analysis toward video indexing, summarization and retrieval. Despite the beneficial previous works in the literature, reliable detection of video shots is still a challenging issue with many unsolved problems. In this paper, we focus on the problem of hard cut detection and propose an automatic algorithm in order to accurately determine abrupt transitions from video. We suggest a fuzzy rule-based scene cut identification approach in which a set of fuzzy rules are evaluated to detect cuts. The main advantage of the proposed method is that, we incorporate spatial and temporal features to describe video frames, and model cut situations according to temporal dependency of video frames as a set of fuzzy rules. Also, while existing cut detection algorithms are mainly threshold dependent; our method identifies cut transitions using a fuzzy logic which is more flexible. The proposed algorithm is evaluated on a variety of video sequences from different genres. Experimental results, in comparison with the most standard cut detection algorithms confirm our method is more robust to object and camera movements as well as illumination changes.  相似文献   

2.
基于运动信息的镜头切变检测算法   总被引:2,自引:0,他引:2  
镜头检测算法的研究是视频分析与检索中较为活跃的研究领域之一。该文提出了一种基于运动信息的镜头切变检测算法,该算法采用四叉树算法分析视频序列的运动信息,进而提取基于运动信息的特征矢量,由提取的特征矢量可以检测出视频序列中的镜头切变点。实验表明该算法是有效的,可行的,并且具有非常强的抗干扰能力。  相似文献   

3.
一种基于有限自动机的渐变镜头检测算法   总被引:3,自引:0,他引:3  
渐变镜头检测算法分为两个方面:渐变边界帧的判定和边界帧的组合。前者判断某一帧是否符合渐变边界帧的每件,后者判断一段包含边界帧的视像是否是渐变。以往的算法侧重解决边界帧的判定,忽视了边界帧的组合。本文定义了渐变检测容忍度的概念,并提出了一种基于有限自动机的渐变镜头检测方法,利用了自动机多状态的记忆性,提高了算法的适应性和鲁棒性。在TRECVID2004的SBD项目中,本渐变镜头检测系统取得了渐变检测性能第一的好成绩。  相似文献   

4.
为了将视频分割成镜头,目前的方法都是提取某些特征然后构造不同的相异性函数。然而,太多的特征就会降低镜头分割算法的效率。因此,有必要对每一个镜头检测决策进行特征约简。基于此,提出了基于粗糙集和模糊聚类的分类方法并得到了相应的决策规则。针对新闻场景的特殊性,将镜头分割成突变过渡、渐变过渡以及无场景变化3类。用超过2个小时的新闻视频所做的实验获得了96.5%的查全率和97.9%的准确率。  相似文献   

5.
This paper describes a fully automatic content-based approach for browsing and retrieval of MPEG-2 compressed video. The first step of the approach is the detection of shot boundaries based on motion vectors available from the compressed video stream. The next step involves the construction of a scene tree from the shots obtained earlier. The scene tree is shown to capture some semantic information as well as to provide a construct for hierarchical browsing of compressed videos. Finally, we build a new model for video similarity based on global as well as local motion associated with each node in the scene tree. To this end, we propose new approaches to camera motion and object motion estimation. The experimental results demonstrate that the integration of the above techniques results in an efficient framework for browsing and searching large video databases.  相似文献   

6.
The purpose of video segmentation is to segment video sequence into shots where each shot represents a sequence of frames having the same contents, and then select key frames from each shot for indexing. Existing video segmentation methods can be classified into two groups: the shot change detection (SCD) approach for which thresholds have to be pre-assigned, and the clustering approach for which a prior knowledge of the number of clusters is required. In this paper, we propose a video segmentation method using a histogram-based fuzzy c-means (HBFCM) clustering algorithm. This algorithm is a hybrid of the two approaches aforementioned, and is designed to overcome the drawbacks of both approaches. The HBFCM clustering algorithm is composed of three phases: the feature extraction phase, the clustering phase, and the key-frame selection phase. In the first phase, differences between color histogram are extracted as features. In the second phase, the fuzzy c-means (FCM) is used to group features into three clusters: the shot change (SC) cluster, the suspected shot change (SSC) cluster, and the no shot change (NSC) cluster. In the last phase, shot change frames are identified from the SC and the SSC, and then used to segment video sequences into shots. Finally, key frames are selected from each shot. Simulation results indicate that the HBFCM clustering algorithm is robust and applicable to various types of video sequences.  相似文献   

7.
Temporal video segmentation and classification of edit effects   总被引:1,自引:0,他引:1  
The process of shot break detection is a fundamental component in automatic video indexing, editing and archiving. This paper introduces a novel approach to the detection and classification of shot transitions in video sequences including cuts, fades and dissolves. It uses the average inter-frame correlation coefficient and block-based motion estimation to track image blocks through the video sequence and to distinguish changes caused by shot transitions from those caused by camera and object motion. We present a number of experiments in which we achieve better results compared with two established techniques.  相似文献   

8.
胡新韬  郭雷  任建峰 《计算机应用》2005,25(6):1302-1304
如何在压缩域进行镜头的切变检测一直是视频自动索引和检索中的难点。提出了一种MPEG压缩域多尺度镜头切变检测算法,在GOP、slot和B帧三个尺度上对MPEG视频流进行分析。通过对相邻I帧的检测,确定一个GOP中是否存在镜头切变;通过对slot的分析,确定镜头切变在GOP中所处的区域;通过对B帧的检测,确定镜头切变发生的确切位置。  相似文献   

9.
根据视频语义分析和视频摘要等应用对于视频数据结构化的需求,提出了一种针对足球视频的镜头分类方法.通过logo模板匹配检测并定位出视频中的慢镜头,对其余的正常比赛部分做镜头边界检测完成视频切分.基于分块的思想,对正常比赛镜头帧计算其各块的场地像素比率值作为特征,利用SVM分类器将正常比赛镜头分为远镜头、中镜头、球员特写或场外镜头3类.至此,整个视频流可以表示为结构化的四类镜头类型标示序列.实验结果表明,该方法在视频切分和镜头类型识别的准确性方面具有良好的效果.  相似文献   

10.
结合标签传递的镜头边界检测与分类   总被引:1,自引:0,他引:1       下载免费PDF全文
镜头是视频的基本组成单元,其自动检测与分类是视频分析的重要任务。为了有效利用视频流视觉上的感知特性,提出一种基于标签传递的镜头边界检测与分类算法。该算法利用半监督学习的标签传递机制,通过视频流中连续多帧之间的相关性,将预先构造的初始状态标签通过相关图不断传递,以揭示不同镜头变化类型的视觉感知特征。然后利用多类SVM分类器进行镜头类型分类。实验结果表明,本文算法能有效识别多种镜头类型,对视频分析、检索等具有一定实用价值。  相似文献   

11.
视频摘要是视频内容的一种压缩表示方式。为了能够更好地浏览视频,提出了一种根据浏览或检索的粒度不同来建立两种层次视频摘要(镜头级和场景级)的思想,并给出了一种视频摘要生成方法:首先用一种根据内容变化自动提取镜头内关键帧的方法来实现关键帧的提取;继而用一种改进的时间自适应算法通过镜头的组合来得到场景;最后在场景级用最小生成树方法提取代表帧。由于关键帧和代表帧分别代表了它们所在镜头和场景的主要内容,因此它们的序列就构成了视频总结。一些电影视频片段检验的实验结果表明,这种生成方法能够较好地提供粗细两种粒度的视频内容总结。  相似文献   

12.
针对如何在镜头基础上进行聚类,以得到更高层次的场景问题,提出了一个基于语义的场景分割算法。该算法首先将视频分割为镜头,并提取镜头的关键帧。然后计算关键帧的颜色直方图和MPEG-7边缘直方图,以形成关键帧的特征;接着利用镜头关键帧的颜色和纹理特征对支持向量机(SVM)进行训练来构造7个基于SVM对应不同语义概念的分类器,并利用它们对要进行场景分割的视频镜头关键帧进行分类,以得到关键帧的语义。并根据关键帧包含的语义概念形成了其语义概念矢量,最后根据语义概念矢量通过对镜头关键帧进行聚类来得到场景。另外.为提取场景关键帧,还构建了镜头选择函数,并根据该函数值的大小来选择场景的关键帧。实验结果表明,该场景分割算法与Hanjalic的方法相比,查准率和查全率分别提高了34.7%和9.1%。  相似文献   

13.
《Real》1999,5(4):231-241
In order to provide sophisticated access methods to the contents of video servers, it is necessary to automatically process and represent each video through a number of visual indexes. We focus on two tasks, namely the hierarchical representation of a video as a sequence of uniform segments (shots), and the characterization of each shot by a vector describing the camera motion parameters. For the first task we use a Bayesian classification approach to detecting scene cuts by analysing motion vectors. Adaptability to different compression qualities is achieved by learning different classification masks. For the second task, the optical flow is processed in order to distinguish between stationary and moving shots. A least-squares fitting procedure determines the pan/tilt/zoom camera parameters within shots that present regular motion. Each shot is then indexed by a vector representing the dominant motion components and the type of motion. In order to maximize processing speed, all techniques directly process and analyse MPEG-1 motion vectors, without the need for video decompression. An overall processing rate of 59 frames/s is achieved on software. The successful classification performance, evaluated on various news video clips for a total of 61 023 frames, attains 97.7% for the shot segmentation, 88.4% for the stationary vs. moving shot classification, and 94.7% for the detailed camera motion characterization.  相似文献   

14.
An Accumulation Algorithm for Video Shot Boundary Detection   总被引:5,自引:0,他引:5  
In this paper, an accumulation algorithm for video shot detection is introduced. The algorithm considers the properties of gradual transition. In a gradual transition, there is only a small difference between consecutive frames. The algorithm remembers the differences between consecutive frames and accumulates them. When the accumulation difference exceeds a threshold, an occurrence of shot transition is declared. Our main contributions are to introduce a frame C that remembers the changes from the beginning of a shot and detect the different types of boundaries (cut, fade, dissolve) at one process. We tested our algorithm with clips extracted from MPEG VCDs. The algorithm showed a good performance in detecting the gradual transitions as well as the abrupt cuts and has the ability to identify different types of boundaries.  相似文献   

15.
Using string matching to detect video transitions   总被引:2,自引:0,他引:2  
The detection of shot boundaries in videos captures the structure of the image sequences by the identification of transitional effects. This task is important in the video indexing and retrieval domain. The video slice or visual rhythm is a single two-dimensional image sampling that has been used to detect several types of video events, including transitions. We use the longest common subsequence (LCS) between two strings to transform the video slice into one-dimensional signals obtaining a highly simplified representation of the video content. We also developed a chain of mathematical morphology operations over these signals leading to the detection of the most frequent video transitions, namely, cut, fade, and wipe. The algorithms are tested with success with various genres of videos.  相似文献   

16.
The fundamental step in video content analysis is the temporal segmentation of video stream into shots, which is known as Shot Boundary Detection (SBD). The sudden transition from one shot to another is known as Abrupt Transition (AT), whereas if the transition occurs over several frames, it is called Gradual Transition (GT). A unified framework for the simultaneous detection of both AT and GT have been proposed in this article. The proposed method uses the multiscale geometric analysis of Non-Subsampled Contourlet Transform (NSCT) for feature extraction from the video frames. The dimension of the feature vectors generated using NSCT is reduced through principal component analysis to simultaneously achieve computational efficiency and performance improvement. Finally, cost efficient Least Squares Support Vector Machine (LS-SVM) classifier is used to classify the frames of a given video sequence based on the feature vectors into No-Transition (NT), AT and GT classes. A novel efficient method of training set generation is also proposed which not only reduces the training time but also improves the performance. The performance of the proposed technique is compared with several state-of-the-art SBD methods on TRECVID 2007 and TRECVID 2001 test data. The empirical results show the effectiveness of the proposed algorithm.  相似文献   

17.
Shot Change Detection via Local Keypoint Matching   总被引:1,自引:0,他引:1  
Shot change detection is an essential step in video content analysis. However, automatic shot change detection often suffers from high false detection rates due to camera or object movements. To solve this problem, we propose an approach based on local keypoint matching of video frames. This approach aims to detect both abrupt and gradual transitions between shots without modeling different kinds of transitions. Our experiment results show that the proposed algorithm is effective for most kinds of shot changes.   相似文献   

18.
章亦葵  赵晖 《计算机应用》2014,34(11):3327-3331
针对视频镜头边界检测的高时耗问题,提出了一种基于视频预处理的视频镜头边界检测(SBD)改进算法。通过使用自适应的阈值选择可能包含镜头边界的候选段,候选段内首帧与其余各帧进行相似度对比检测出镜头起始帧,并立即检测切变。若候选段中不包含切变,则进行渐变检测。调整候选段以保证镜头边界位于同一段内,段内其余各帧与起始帧进行相似度对比确定镜头结束帧。实验结果表明,所提算法镜头边界识别准确率能够达到90%以上,且与倒三角模式匹配方法相比能够节约时间15.6%~30.2%;与对渐变和切变分别检测的算法相比,该算法能够在满足识别率的基础上提升检测速度。  相似文献   

19.
利用改进NFL算法对镜头进行基于内容的检索   总被引:9,自引:1,他引:9  
基于镜头的分类和检索对于视频库的管理和查询非常重要.将“最近特征线”法(nearest feature line,简称NFL)用于镜头的分类和检索.将镜头中的代表帧看做是某个特征空间中的点,通过这些点间的连线表征该镜头的总体特征信息,然后计算查询图像和特征线的距离,以决定镜头与查询图像的相似度.为了更适于视频数据,对原来的NFL方法进行了改进,基于镜头内部内容活动程度对特征线进行限制、实验结果表明,改进的NFL方法比传统的NFL方法以及常用的聚类万法,如最近邻法(nearest neighbor,简称NN)和最近中心法(nearest center,简称NC),在性能上有所提高.  相似文献   

20.
提出了一种新的基于边缘特征的帧间相似性度量方法,并在此基础上实现了一个实时的镜头切变检测算法。为了降低基于特征的算法的运算复杂度,该算法采用一种快速的边缘模式分类方法从部分解码的码流中提取视频帧的边缘特征,通过考察相邻帧边缘分布的相似性定义了一种反映局部信息的帧间相似性度量。结合反映全局特征的基于彩色直方图的相似性的度量和改进的滑动窗算法,实现了高性能的镜头边缘检测。相对于现有的基于特征的算法,该算法具有更低的运算复杂度,适合实时应用。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号