首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
The purpose of this article is to analyze current vision-based systems from a soccer video semantic point of view such as video summarization, features analysis, and provision of augmented information. Currently, computer vision techniques are applicable in a challenging soccer context. Scene interpretation is performed based on the complexity of the semantic. For each area of vision-based systems, computer vision methodologies are analyzed along with their strengths and weaknesses. We have also investigated whether the existing approaches are equally applicable for real-time soccer video semantic analysis.  相似文献   

2.
As a special application of computer vision, automatic sports video analysis has been studied by some researchers. This sports video analysis via computer vision is a moderately challenging problem: it is more difficult than analyzing a video of a few laboratory members acting as in a simple scenario and is easier than analyzing a video of crowded people at a subway station. So the success of an analysis heavily depends on how much one can exploit the prior information on the sport and setting. The most challenging and important part would be the tracking of players (and ball). With a multi-camera system, 3D tracking is feasible which is much more meaningful than 2D tracking for the analysis. As an initial step of 3D player tracking from multi-view soccer videos, this paper deals with automatic initialization of player positions. Initial 3D positions can be estimated by exploiting some conditions of a soccer match. To make it robust, prior knowledge on the features of players is learnt by support vector machines (SVM). Experimental results show that the proposed system is efficient for general soccer sequences.  相似文献   

3.
近期, 体育视频分析中的广告牌商标的探测和识别方法已经广泛应用于许多其他领域, 比如商业电视. 基于此, 提出了一种能在不同体育视频(如足球、篮球和F1赛车等)中进行广告牌商标实时识别的算法, 该算法主要包括两个步骤, 首先, 利用基于模糊决策树的方法进行广告牌图像帧的探测; 其次, 利用颜色特征和局部SIFT (Scale-invariant feature transform)特征来描述不同商标的外观, 并最终通过基于潜在语义分析(Latent semantic analysis, LSA)的SIFT词汇匹配来识别所给定的商标模板. 初步的实验表明了本文算法的有效性, 并且该算法能在实时情况下运行.  相似文献   

4.
This paper presents a classified review of soccer video analysis works. The existing approaches in the aspects of highlight event detection, video summarization and retrieval based on video stream, ball and player tracking for provision of match statistics, technical and tactical analysis and application of different sources in soccer video analysis have been surveyed. In addition, some major existing commercial softwares developed for video analysis are introduced and compared. With regard to the existing challenge for automatic and realtime provision of video analysis, different computer vision approaches are discussed and compared. Audio, video and text feature extraction methods have been investigated and the future trends for improvement of the reviewed systems have been introduced in terms of response time optimization, increase of precision and eliminating the need of human intervention for video analysis.  相似文献   

5.
Using Webcast Text for Semantic Event Detection in Broadcast Sports Video   总被引:1,自引:0,他引:1  
Sports video semantic event detection is essential for sports video summarization and retrieval. Extensive research efforts have been devoted to this area in recent years. However, the existing sports video event detection approaches heavily rely on either video content itself, which face the difficulty of high-level semantic information extraction from video content using computer vision and image processing techniques, or manually generated video ontology, which is domain specific and difficult to be automatically aligned with the video content. In this paper, we present a novel approach for sports video semantic event detection based on analysis and alignment of webcast text and broadcast video. Webcast text is a text broadcast channel for sports game which is co-produced with the broadcast video and is easily obtained from the web. We first analyze webcast text to cluster and detect text events in an unsupervised way using probabilistic latent semantic analysis (pLSA). Based on the detected text event and video structure analysis, we employ a conditional random field model (CRFM) to align text event and video event by detecting event moment and event boundary in the video. Incorporation of webcast text into sports video analysis significantly facilitates sports video semantic event detection. We conducted experiments on 33 hours of soccer and basketball games for webcast analysis, broadcast video analysis and text/video semantic alignment. The results are encouraging and compared with the manually labeled ground truth.   相似文献   

6.
Automatic sport video analysis has became one of the most attractive research fields in the areas of computer vision and multimedia technologies. In particular, there has been a boom in soccer video analysis research. This paper presents a new multi-step algorithm to automatically detect the soccer ball in image sequences acquired from static cameras. In each image, candidate ball regions are selected by analyzing edge circularity and then ball patterns are extracted representing locally affine invariant regions around distinctive points which have been highlighted automatically. The effectiveness of the proposed methodologies is demonstrated through a huge number of experiments using real balls under challenging conditions, as well as a favorable comparison with some of the leading approaches from the literature.  相似文献   

7.
足球视频的结构分析与概要   总被引:3,自引:0,他引:3  
该文描述了一种有效的框架对足球视频进行结构分析,根据电影特征和对象特征生成视频概要。由于足球视频的特殊性,本文在镜头边界检测中采用分层检测的方法:象素点对的比较、颜色直方图和对象分割和跟踪技术。我们在镜头分类中对中远镜头的区分提出了新的方法。以慢动作回放镜头为标志,通过分析镜头间的关联规则生成视频概要。  相似文献   

8.
针对视频高层语义分析问题,文章结合足球比赛的领域知识,按照足球比赛转播,视频编辑的一般规律,根据足球比赛语义事件随机性的特点,选择特定的视频物理特征,应用 HMM (隐马尔科夫模型) 分析视频的语义结构,确定视频和HMM 模型中各元素的对应关系,构建一个基于HMM 的视频语义分析框架,并通过进行足球视频 HMM 参数的训练,得到视频各语义事件的 HMM 模型,达到视频语义自动分析的目的.  相似文献   

9.
Automatic composition of broadcast sports video   总被引:1,自引:0,他引:1  
This study examines an automatic broadcast soccer video composition system. The research is important as the ability to automatically compose broadcast sports video will not only improve broadcast video generation efficiency, but also provides the possibility to customize sports video broadcasting. We present a novel approach to the two major issues required in the system’s implementation, specifically the camera view selection/switching module and the automatic replay generation module. In our implementation, we use multi-modal framework to perform video content analysis, event and event boundary detection from the raw unedited main/sub-camera captures. This framework explores the possible cues using mid-level representations to bridge the gap between low-level features and high-level semantics. The video content analysis results are utilized for camera view selection/switching in the generated video composition, and the event detection results and mid-level representations are used to generate replays which are automatically inserted into the broadcast soccer video. Our experimental results are promising and found to be comparable to those generated by broadcast professionals.  相似文献   

10.
《Real》1999,5(5):295-304
This paper reports on tracking of multiple objects using color histogram backprojection and motion cues. Four tasks which facilitate this are discussed. The first is an adaptive color histogram backprojection (which builds upon the works of Swain and Ballard) and its application to tracking of multiple objects in video sequences. The second task is designing efficient fast blob detectors for selecting regions of interest in video sequences. The third is motion detection based on color histogram backprojection. Achieving these tasks led to multi-objects tracking. Various video sequences were used to demonstrate effective tracking of multiple objects. Notably, we created an interactive multiple objects tracker (CLICK-IT) which in its present form is set at three objects but can be extended easily. CLICK-IT (CSIRO Laboratory for Imaging by Content and Knowledge—Interactive Television) is a PC-based system which provides the user with an intelligent highlighter pen for sports action replay. It is intended as a truly interactive improvement on the drawing pad technology currently used for video annotation in sports broadcasting. The system uses computer vision techniques to focus attention and track particular objects (player(s), ball, horse(s), …) and semi-automatically annotate the dynamic scene. This paper describes the system including the user interface, the tracking technology based on color and motion information, and system performance evaluation in applications to surveillance-like sequences, running, rugby league football, basketball and soccer. Finally, video scene detection based on color histogram is discussed.  相似文献   

11.
During the past decade, feature extraction and knowledge acquisition based on video analysis have been extensively researched and tested on many applications such as closed-circuit television(CCTV)data analysis, large-scale public event control, and other daily security monitoring and surveillance operations with various degrees of success. However, since the actual video process is a multi-phased one and encompasses extensive theories and techniques ranging from fundamental image processing, computational geometry and graphics, and machine vision, to advanced artificial intelligence, pattern analysis, and even cognitive science, there are still many important problems to resolve before it can be widely applied. Among them, video event identification and detection are two prominent ones. Comparing with the most popular frame-to-frame processing mode of most of today's approaches and systems, this project reorganizes video data as a 3D volume structure that provides the hybrid spatial and temporal information in a unified space. This paper reports an innovative technique to transform original video frames to 3D volume structures denoted by spatial and temporal features. It then highlights the volume array structure in a so-called "pre-suspicion" mechanism for a later process. The focus of this report is the development of an effective and efficient voxel-based segmentation technique suitable to the volumetric nature of video events and ready for deployment in 3D clustering operations. The paper is concluded with a performance evaluation of the devised technique and discussion on the future work for accelerating the pre-processing of the original video data.  相似文献   

12.
基于子窗口区域的足球视频镜头分类   总被引:1,自引:1,他引:0       下载免费PDF全文
为了对海量视频数据进行有效的管理和快速浏览,急需对数字视频进行基于内容的视频检索。镜头分类是足球视频处理与检索的重要部分,针对目前现有足球镜头分类方法存在算法准确性不高或运算量过大的问题,提出了一种新的基于子窗口区域的镜头分类方法。该方法采用在HSV颜色空间中计算足球视频帧子窗口区域球场色像素比率,并辅以边缘信息的检测,对足球视频中的主镜头、中镜头、特写镜头和其他镜头进行了分类,实验结果表明该方法切实可行,具有很高的检出率和准确率。  相似文献   

13.
This paper presents a state of the art review of features extraction for soccer video summarization research. The all existing approaches with regard to event detection, video summarization based on video stream and application of text sources in event detection have been surveyed. As regard the current challenges for automatic and real time provision of summary videos, different computer vision approaches are discussed and compared. Audio, video feature extraction methods and their combination with textual methods have been investigated. Available commercial products are presented to better clarify the boundaries in this domain and future directions for improvement of existing systems have been suggested.  相似文献   

14.
多模态体育视频语义分析   总被引:3,自引:0,他引:3  
以足球运动为例提出了一种体育视频语义结构,并提出相应的语义分析框架。视频被分解为纯视频流和音频流两种模态,每种模态均可依次提取和综合出低层内容和中层内容。视频流可根据低层(物理)内容分割为物理镜头,然后根据特定的中间层内容可以确定为语法镜头。音频也可以在物理特征的基础上形成有意义的中间层内容,如解说员兴奋时的声音。最后,根据视频流和音频流的中间层内容,按照足球比赛转播的规律,分析出比赛中的精彩事件,并选取相关的镜头作为反映此事件的序列组合。  相似文献   

15.
Tactic analysis is an exciting and challenging problem in sport video analysis. The trajectories of ball and players convey rich tactic information, so the trajectory extraction and analysis are important for the soccer tactic analysis. Previous research on tactic analysis was generally based on finding object's mosaic trajectory which does not capture the rich semantic information of the real-world trajectory. In this paper, we propose a complete framework to systematically analyze soccer tactics. Specifically, we first propose an efficient real-world trajectory extraction method based on field line detection. Secondly, we define and recognize six typical soccer attack patterns for tactic analysis. With experiments on user study, the proposed method can improve the tactic analysis in terms of the conciseness, clarity, and usability.  相似文献   

16.
足球是最具世界性的体育运动之一,球迷遍布五大洲,因此在体育视频节目中足球备受广大观众青睐.在分析了足球视频特点的基础上,提出了一种基于基本语义单元合成Petri网的足球视频查询描述模型.该模型首先定义了一种类似文本字词集合的足球视频基本语义单元集合,在此基础上采用基本语义单元合成Petri网模型建立了一种足球查询语义的描述模型,并分别构建了进球、进攻、角球、犯规、换人等足球语义.初步的实验结果验证了该模型的有效性,并能推广至球类视频和其他体育视频.  相似文献   

17.
One of the main tasks of mobile robotics is vision. Lighting independence, adaptivity and automated learning are still the main issues when it comes to applications. In this article, we present an image understanding system and its methods targeting automatic, lighting-independent and reliable color-based object recognition under real time conditions. Its application test bed is global vision robot soccer (i.e. FIRA MiroSot und RoboCup Small Size leagues) but it has many other applications in color-based supervision of moving objects. Under typical conditions, it learns the objects of recognition automatically, has zero setup time and tolerates environmental changes during run-time.  相似文献   

18.
Learning identity with radial basis function networks   总被引:11,自引:0,他引:11  
Radial basis function (RBF) networks are compared with other neural network techniques on a face recognition task for applications involving identification of individuals using low-resolution video information. The RBF networks are shown to exhibit useful shift, scale and pose (y-axis head rotation) invariance after training when the input representation is made to mimic the receptive field functions found in early stages of the human vision system. In particular, representations based on difference of Gaussian (DoG) filtering and Gabor wavelet analysis are compared. Extensions of the techniques to the case of image sequence analysis are described and a time delay (TD) RBF network is used for recognising simple movement-based gestures. Finally, we discuss how these techniques can be used in real-life applications that require recognition of faces and gestures using low-resolution video images.  相似文献   

19.
为了更好地满足用户浏览和检索视频的需要,提出一种融合文本的足球视频事件分析框架.分别从文本和视频中提取事件信息,采用动态规划的算法对2种信息进行全局匹配,对于未匹配的文本事件信息.采用一个全局概率模型估计其在视频中的事件边界.通过寻找文本与视频事件信息的最优全局匹配,有效地避免了局部匹配方法造成的漏检和误检.实验结果表明,文中方法能够快速、准确地检测事件,获得详尽的事件内容信息,性能优于局部匹配算法.  相似文献   

20.
HMM模型具有良好的适应性,可以自动学习,对预测随机时序数据性能良好。场景是足球视频的基本特征,场景的转换体现了足球视频的摄制、编辑模式,表现了足球视频的语义。提出了一种基于场景分析和HMM的视频语义分析框架,用于识别足球视频中的一些语义事件。为了克服以往基于主颜色和其他底层特征的视频场景分析中存在的较大误差,又提出基于视觉注意模型对足球视频中的场景进行分析。实验结果表明,基于场景分析和HMM的事件识别方法对足球视频中的任意球事件有良好的识别效果  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号