共查询到20条相似文献,搜索用时 0 毫秒
1.
In our earlier work, we proposed an architecture for a Web-based video database management system (VDBMS) providing an integrated support for spatiotemporal and semantic queries. In this paper, we focus on the task of spatiotemporal query processing and also propose an SQL-like video query language that has the capability to handle a broad range of spatiotemporal queries. The language is rule-based in that it allows users to express spatial conditions in terms of Prolog-type predicates. Spatiotemporal query processing is carried out in three main stages: query recognition, query decomposition, and query execution.Received: 11 October 2001, Accepted: 3 October 2003, Published online: 12 December 2003Edited by: A. Buchmann
Correspondence to: Özgür UlusoyThis work is supported by the Scientific and Research Council of Turkey (TÜBITAK) under Project Code 199E025. This work was done while the first author was at Bilkent University. 相似文献
2.
Smart VideoText: a video data model based on conceptual graphs 总被引:2,自引:0,他引:2
F. Kokkoras H. Jiang I. Vlahavas A.K. Elmagarmid E.N. Houstis W.G. Aref 《Multimedia Systems》2002,8(4):328-338
An intelligent annotation-based video data model called Smart VideoText is introduced. It utilizes the conceptual graph knowledge
representation formalism to capture the semantic associations among the concepts described in text annotations of video data.
The aim is to achieve more effective query, retrieval, and browsing capabilities based on the semantic content of video data.
Finally, a generic and modular video database architecture based on the Smart VideoText data model is described. 相似文献
3.
The management of large collections of music data in a multimedia database has received much attention in the past few years. In the majority of current work, researchers extract the features, such as melodies, rhythms, and chords, from the music data and develop indices that will help to retrieve the relevant music quickly. Several reports have pointed out that these music features can be transformed and represented in forms of music feature strings or numeric values so that indices can be created for music retrieval. However, there are only a small number of existing approaches which introduce multi-feature index structures for music queries while most of the others are for developing single feature indices. The existing music multi-feature index structures are memory consuming and have lack of scalability. In this paper, we will propose a two-tier music index structure which is an efficient and scalable approach for multi-feature music indexing. Our experimental results show that this new approach outperforms existing multi-feature index schemes. 相似文献
4.
Nicolas Moënne-Loccoz Bruno Janvier Stéphane Marchand-Maillet Eric Bruno 《Multimedia Tools and Applications》2006,31(3):309-325
Video document retrieval is now an active part of the domain of multimedia retrieval. However, unlike for other media, the management of a collection of video documents adds the problem of efficiently handling an overwhelming volume of temporal data. Challenges include balancing efficient content modeling and storage against fast access at various levels. In this paper, we detail the framework we have built to accommodate our developments in content-based multimedia retrieval. We show that not only our framework facilitates the development of processing and indexing algorithms but it also opens the way to several other possibilities such as rapid interface prototyping or retrieval algorithm benchmarking. Here, we discuss our developments in relation to wider contexts such as MPEG-7 and the TREC Video Track.This work is funded by EU-FP6 IST-NoE SIMILAR () and the Swiss NCCR IM2 (Interactive Multimodal Information Management). 相似文献
5.
Guzen Erozel 《Information Sciences》2008,178(12):2534-2552
The video databases have become popular in various areas due to the recent advances in technology. Video archive systems need user-friendly interfaces to retrieve video frames. In this paper, a user interface based on natural language processing (NLP) to a video database system is described. The video database is based on a content-based spatio-temporal video data model. The data model is focused on the semantic content which includes objects, activities, and spatial properties of objects. Spatio-temporal relationships between video objects and also trajectories of moving objects can be queried with this data model. In this video database system, a natural language interface enables flexible querying. The queries, which are given as English sentences, are parsed using link parser. The semantic representations of the queries are extracted from their syntactic structures using information extraction techniques. The extracted semantic representations are used to call the related parts of the underlying video database system to return the results of the queries. Not only exact matches but similar objects and activities are also returned from the database with the help of the conceptual ontology module. This module is implemented using a distance-based method of semantic similarity search on the semantic domain-independent ontology, WordNet. 相似文献
6.
In this paper, we propose a multi-level abstraction mechanism for capturing the spatial and temporal semantics associated
with various objects in an input image or in a sequence of video frames. This abstraction can manifest itself effectively
in conceptualizing events and views in multimedia data as perceived by individual users. The objective is to provide an efficient
mechanism for handling content-based queries, with the minimum amount of processing performed on raw data during query evaluation.
We introduce a multi-level architecture for video data management at different levels of abstraction. The architecture facilitates
a multi-level indexing/searching mechanism. At the finest level of granularity, video data can be indexed based on mere appearance
of objects and faces. For management of information at higher levels of abstractions, an object-oriented paradigm is proposed
which is capable of supporting domain specific views. 相似文献
7.
Yon Dohn Chung 《Information Sciences》2007,177(2):525-542
Wireless data broadcasting is a popular data delivery approach in mobile computing environments, where the broadcasting servers usually adopt indexing schemes for mobile clients to energy-efficiently access data on a wireless broadcast stream. However, conventional indexing schemes use primary key attribute values to construct tree structures. Therefore, these schemes do not support content-based retrieval queries such as partial-match queries and range-queries. This paper proposes an indexing method that supports content-based retrieval queries on a wireless data stream. The method uses a tree-structured index, called B2V-Tree, which is composed of bit-vectors that are generated from data records through multi-attribute hashing. Through analysis and experiments, the effectiveness of the proposed method is shown. 相似文献
8.
9.
A compact multi-resolution index for variable length queries in time series databases 总被引:1,自引:1,他引:0
We study the problem of searching similar patterns in time series data for variable length queries. Recently, a multi-resolution indexing technique (MRI) was proposed in (Kahveci and Singh, in proceedings of the international conference on data engineering, pp. 273–282, 2001; Kahveci and Singh, IEEE Trans Knowl Data Eng 16(4):418–433, 2004) to address this problem, which uses compression as an additional step to reduce the index size. In this paper, we propose an alternative technique, called compact MRI (CMRI), which uses adaptive piecewise constant approximation (APCA) representation as dimensionality reduction technique, and which occupies much less space without requiring compression. We implemented both MRI and CMRI, and conducted extensive experiments to evaluate and compare their performance on real stock data as well as synthetic. Our results indicate that CMRI provides a much better precision ranging from 0.75 to 0.89 on real data, and from 0.80 to 0.95 on synthetic data, while for MRI, these ranges are from 0.16 to 0.34, and from 0.03 to 0.65, respectively. Compared to sequential scan, we found that CMRI is 4–30 times faster and the number of disk I/Os it required is close to minimal. In terms of storage utilization, CMRI occupies 1% of the memory occupied by MRI. These results and analysis show CMRI to be an efficient and scalable indexing technique for large time series databases. 相似文献
10.
11.
12.
Exploring video content structure for hierarchical summarization 总被引:4,自引:0,他引:4
In this paper, we propose a hierarchical video summarization strategy that explores video content structure to provide the users with a scalable, multilevel video summary. First, video-shot- segmentation and keyframe-extraction algorithms are applied to parse video sequences into physical shots and discrete keyframes. Next, an affinity (self-correlation) matrix is constructed to merge visually similar shots into clusters (supergroups). Since video shots with high similarities do not necessarily imply that they belong to the same story unit, temporal information is adopted by merging temporally adjacent shots (within a specified distance) from the supergroup into each video group. A video-scene-detection algorithm is thus proposed to merge temporally or spatially correlated video groups into scenario units. This is followed by a scene-clustering algorithm that eliminates visual redundancy among the units. A hierarchical video content structure with increasing granularity is constructed from the clustered scenes, video scenes, and video groups to keyframes. Finally, we introduce a hierarchical video summarization scheme by executing various approaches at different levels of the video content hierarchy to statically or dynamically construct the video summary. Extensive experiments based on real-world videos have been performed to validate the effectiveness of the proposed approach.Published online: 15 September 2004
Corespondence to: Xingquan ZhuThis research has been supported by the NSF under grants 9972883-EIA, 9974255-IIS, 9983248-EIA, and 0209120-IIS, a grant from the state of Indiana 21th Century Fund, and by the U.S. Army Research Laboratory and the U.S. Army Research Office under grant DAAD19-02-1-0178. 相似文献
13.
The need for content-based access to image and video information from media archives has captured the attention of researchers in recent years. Research efforts have led to the development of methods that provide access to image and video data. These methods have their roots in pattern recognition. The methods are used to determine the similarity in the visual information content extracted from low level features. These features are then clustered for generation of database indices. This paper presents a comprehensive survey on the use of these pattern recognition methods which enable image and video retrieval by content. 相似文献
14.
15.
In a moving-object database system that supports continuous queries (CQ), an important problem is to keep the location data consistent with the actual locations of the entities being monitored, in order to produce correct query results. This goal is often difficult to achieve due to limited network resources. However, if an object is not required by any query, its value need not be refreshed. Based on this observation, we redefine the notion of temporal consistency of data items with respect to the query result, where only data items that are relevant to the CQs need to be fresh. To exploit this correctness definition, we develop an adaptive time-based update technique called query-result update (QRU). The advantage of this technique is that it identifies objects with different levels of significance to the correctness of query results. Locations of objects that have more impact to the query result are acquired more frequently than the ones that do not. 相似文献
16.
17.
Most current video retrieval systems use shots as the basis for information organization and access. In cinematography, scene is the basic story unit that the directors use to compose and convey their ideas. This paper proposes a framework based on the concept of continuity to analyze video contents and extract scene boundaries. Starting from a set of shots, the framework successively applies the concept of visual, position, camera focal distance, motion, audio and semantic continuity to group shots that exhibit some form of continuity into scenes. The framework helps to explain the principles and the heuristics behind most cinematic rules. The idea is tested using the first three levels of continuity to extract the scenes defined using the most common cinematic rules. The method has been found to be effective. 相似文献
18.
Luis Herranz 《Multimedia Systems》2007,13(2):103-118
Scalable video coding has become a key technology to deploy systems where the adaptation of content to diverse constrained
usage environments (such as PDAs, mobile phones and networks) is carried out in a simple and efficient way. Content-based
adaptation and summarization are fields that aim for providing improved adaptation to the user, trying to optimize the semantic
coverage in the adapted/summarized version. This paper proposes the integration of content analysis with scalable video adaptation
paradigm. They must be fitted in such a way that the efficiency of scalable adaptation is not damaged. An integrated framework
is proposed for semantic video adaptation, as well as an adaptive skimming scheme that can use the results of semantic analysis.
They are described using the MPEG-21 DIA tools to provide the adaptation in a standard framework. Particularly, the case of
activity analysis is described to illustrate the integration of semantic analysis in the framework, and its use for online
content summarization and adaptation. Overall efficiency is achieved by means of computing activity using compressed domain
analysis with several metrics evaluated as measures of activity.
Work supported by the Ministerio de Ciencia y Tecnología of the Spanish Government under project TIN2004-07860 (MEDUSA) and
by the Comunidad de Madrid under project S-0505-TIC-0223 (PROMULTIDIS). 相似文献
19.
This paper proposes a two-stage system for text detection in video images. In the first stage, text lines are detected based on the edge map of the image leading in a high recall rate with low computational time expenses. In the second stage, the result is refined using a sliding window and an SVM classifier trained on features obtained by a new Local Binary Pattern-based operator (eLBP) that describes the local edge distribution. The whole algorithm is used in a multiresolution fashion enabling detection of characters for a broad size range. Experimental results, based on a new evaluation methodology, show the promising overall performance of the system on a challenging corpus, and prove the superior discriminating ability of the proposed feature set against the best features reported in the literature. 相似文献
20.
Periodic broadcast is a cost-effective solution for large-scale distribution of popular videos. Regardless of the number of
video requests, this strategy guarantees a constant worst service latency to all clients, making it possible to serve a large
community with a minimal amount of broadcast bandwidth. Although many efficient periodic broadcast techniques have been proposed,
most of them impose rigid requirements on client receiving bandwidth. They either demand clients to have the same bandwidth
as the video server, or limit them to receive no more than two video streams at any one time. In our previous work, we addressed
this problem with a Client-Centric Approach (CCA). This scheme takes into consideration both server broadcast bandwidth and client receiving bandwidth and allows clients
to use all their receiving capability for prefetching broadcast data. As a result, given a fixed broadcast bandwidth, a shorter
broadcast period can be achieved with an improved client communication capability. In this paper, we present an enhanced version
of CCA to further leverage client bandwidth for more efficient video broadcast. The new scheme reduces the broadcast latency
up to 50% as compared to CCA. We prove the correctness of this new technique and provide an analytical evaluation to show
its performance advantage as compared with some existing techniques.
相似文献
Johnny WongEmail: |