共查询到20条相似文献,搜索用时 0 毫秒
1.
Xiangmin Zhou Lei Chen 《The VLDB Journal The International Journal on Very Large Data Bases》2014,23(3):381-400
In recent years, microblogs have become an important source for reporting real-world events. A real-world occurrence reported in microblogs is also called a social event. Social events may hold critical materials that describe the situations during a crisis. In real applications, such as crisis management and decision making, monitoring the critical events over social streams will enable watch officers to analyze a whole situation that is a composite event, and make the right decision based on the detailed contexts such as what is happening, where an event is happening, and who are involved. Although there has been significant research effort on detecting a target event in social networks based on a single source, in crisis, we often want to analyze the composite events contributed by different social users. So far, the problem of integrating ambiguous views from different users is not well investigated. To address this issue, we propose a novel framework to detect composite social events over streams, which fully exploits the information of social data over multiple dimensions. Specifically, we first propose a graphical model called location-time constrained topic (LTT) to capture the content, time, and location of social messages. Using LTT, a social message is represented as a probability distribution over a set of topics by inference, and the similarity between two messages is measured by the distance between their distributions. Then, the events are identified by conducting efficient similarity joins over social media streams. To accelerate the similarity join, we also propose a variable dimensional extendible hash over social streams. We have conducted extensive experiments to prove the high effectiveness and efficiency of the proposed approach. 相似文献
2.
Xingquan Zhu Wei Ding Philip S. Yu Chengqi Zhang 《Knowledge and Information Systems》2011,28(3):523-553
In this paper, we formulate a new research problem of concept learning and summarization for one-class data streams. The main
objectives are to (1) allow users to label instance groups, instead of single instances, as positive samples for learning,
and (2) summarize concepts labeled by users over the whole stream. The employment of the batch-labeling raises serious issues
for stream-oriented concept learning and summarization, because a labeled instance group may contain non-positive samples
and users may change their labeling interests at any time. As a result, so the positive samples labeled by users, over the
whole stream, may be inconsistent and contain multiple concepts. To resolve these issues, we propose a one-class learning
and summarization (OCLS) framework with two major components. In the first component, we propose a vague one-class learning
(VOCL) module for concept learning from data streams using an ensemble of classifiers with instance level and classifier level
weighting strategies. In the second component, we propose a one-class concept summarization (OCCS) module that uses clustering
techniques and a Markov model to summarize concepts labeled by users, with only one scanning of the stream data. Experimental
results on synthetic and real-world data streams demonstrate that the proposed VOCL module outperforms its peers for learning
concepts from vaguely labeled stream data. The OCCS module is also able to rebuild a high-level summary for concepts marked
by users over the stream. 相似文献
3.
Event summarization is a task to generate a single, concise textual representation of an event. This task does not consider multiple development phases in an event. However, news articles related to long and complicated events often involve multiple phases. Thus, traditional approaches for event summarization generally have difficulty in capturing event phases in summarization effectively. In this paper, we define the task of Event Phase Oriented News Summarization (EPONS). In this approach, we assume that a summary contains multiple timelines, each corresponding to an event phase. We model the semantic relations of news articles via a graph model called Temporal Content Coherence Graph. A structural clustering algorithm EPCluster is designed to separate news articles into several groups corresponding to event phases. We apply a vertex-reinforced random walk to rank news articles. The ranking results are further used to create timelines. Extensive experiments conducted on multiple datasets show the effectiveness of our approach. 相似文献
4.
《Expert systems with applications》2014,41(15):6904-6916
With the number of documents describing real-world events and event-oriented information needs rapidly growing on a daily basis, the need for efficient retrieval and concise presentation of event-related information is becoming apparent. Nonetheless, the majority of information retrieval and text summarization methods rely on shallow document representations that do not account for the semantics of events. In this article, we present event graphs, a novel event-based document representation model that filters and structures the information about events described in text. To construct the event graphs, we combine machine learning and rule-based models to extract sentence-level event mentions and determine the temporal relations between them. Building on event graphs, we present novel models for information retrieval and multi-document summarization. The information retrieval model measures the similarity between queries and documents by computing graph kernels over event graphs. The extractive multi-document summarization model selects sentences based on the relevance of the individual event mentions and the temporal structure of events. Experimental evaluation shows that our retrieval model significantly outperforms well-established retrieval models on event-oriented test collections, while the summarization model outperforms competitive models from shared multi-document summarization tasks. 相似文献
5.
Edith Cohen Nick Duffield Haim Kaplan Carstent Lund Mikkel Thorup 《Journal of Computer and System Sciences》2014
Statistical summaries of IP traffic are at the heart of network operation and are used to recover aggregate information on subpopulations of flows. It is therefore of great importance to collect the most accurate and informative summaries given the router's resource constraints. A summarization algorithm, such as Cisco's sampled NetFlow, is applied to IP packet streams that consist of multiple interleaving IP flows. We develop sampling algorithms and unbiased estimators which address sources of inefficiency in current methods. First, we design tunable algorithms whereas currently a single parameter (the sampling rate) controls utilization of both memory and processing/access speed (which means that it has to be set according to the bottleneck resource). Second, we make a better use of the memory hierarchy, which involves exporting partial summaries to slower storage during the measurement period. 相似文献
6.
Event detection and analysis from video streams 总被引:9,自引:0,他引:9
Medioni G. Cohen I. Bremond F. Hongeng S. Nevatia R. 《IEEE transactions on pattern analysis and machine intelligence》2001,23(8):873-889
We present a system which takes as input a video stream obtained from an airborne moving platform and produces an analysis of the behavior of the moving objects in the scene. To achieve this functionality, our system relies on two modular blocks. The first one detects and tracks moving regions in the sequence. It uses a set of features at multiple scales to stabilize the image sequence, that is, to compensate for the motion of the observer, then extracts regions with residual motion and uses an attribute graph representation to infer their trajectories. The second module takes as input these trajectories, together with user-provided information in the form of geospatial context and goal context to instantiate likely scenarios. We present details of the system, together with results on a number of real video sequences and also provide a quantitative analysis of the results 相似文献
7.
Lu Zhao Lin Yu-Ru Huang Xiaoxia Xiong Naixue Fang Zhijun 《Multimedia Tools and Applications》2017,76(8):10855-10879
Multimedia Tools and Applications - Nowadays, microblogging has become popular, with hundreds of millions of short messages being posted and shared every minute on a variety of topics in social... 相似文献
8.
Crisci Alfonso Grasso Valentina Nesi Paolo Pantaleo Gianni Paoli Irene Zaza Imad 《Multimedia Tools and Applications》2018,77(10):12203-12232
Multimedia Tools and Applications - The predictive capabilities of metrics based on Twitter data have been stressed in different fields: business, health, market, politics, etc. In specific cases,... 相似文献
9.
Keyframe-based video summarization using Delaunay clustering 总被引:1,自引:0,他引:1
Padmavathi Mundur Yong Rao Yelena Yesha 《International Journal on Digital Libraries》2006,6(2):219-232
Recent advances in technology have made tremendous amounts of multimedia information available to the general population.
An efficient way of dealing with this new development is to develop browsing tools that distill multimedia data as information
oriented summaries. Such an approach will not only suit resource poor environments such as wireless and mobile, but also enhance
browsing on the wired side for applications like digital libraries and repositories. Automatic summarization and indexing
techniques will give users an opportunity to browse and select multimedia document of their choice for complete viewing later.
In this paper, we present a technique by which we can automatically gather the frames of interest in a video for purposes
of summarization. Our proposed technique is based on using Delaunay Triangulation for clustering the frames in videos. We
represent the frame contents as multi-dimensional point data and use Delaunay Triangulation for clustering them. We propose
a novel video summarization technique by using Delaunay clusters that generates good quality summaries with fewer frames and
less redundancy when compared to other schemes. In contrast to many of the other clustering techniques, the Delaunay clustering
algorithm is fully automatic with no user specified parameters and is well suited for batch processing. We demonstrate these
and other desirable properties of the proposed algorithm by testing it on a collection of videos from Open Video Project.
We provide a meaningful comparison between results of the proposed summarization technique with Open Video storyboard and
K-means clustering. We evaluate the results in terms of metrics that measure the content representational value of the proposed
technique. 相似文献
10.
Do Heon Lee Myoung Ho Kim 《IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics》1997,27(4):671-680
Summary discovery is one of the major components of knowledge discovery in databases, which provides the user with comprehensive information for grasping the essence from a large amount of information in a database. In this paper, we propose an interactive top-down summary discovery process which utilizes fuzzy ISA hierarchies as domain knowledge. We define a generalized tuple as a representational form of a database summary including fuzzy concepts. By virtue of fuzzy ISA hierarchies where fuzzy ISA relationships common in actual domains are naturally expressed, the discovery process comes up with more accurate database summaries. We also present an informativeness measure for distinguishing generalized tuples that delivers much information to users, based on Shannon's information theory. 相似文献
11.
Applied Intelligence - Stock Price Prediction is one of the hot research topics in financial engineering, influenced by economic, social, and political factors. In the present stock market, the... 相似文献
12.
Comparative news summarization aims to highlight the commonalities and differences between two comparable news topics by using human-readable sentences. The summary ought to focus on the salient comparative aspects of both topics, and at the same time, it should describe the representative properties of each topic appropriately. In this study, we propose a novel approach for generating comparative news summaries. We consider cross-topic pairs of semantic-related concepts as evidences of comparativeness and consider topic-related concepts as evidences of representativeness. The score of a summary is estimated by summing up the weights of evidences in the summary. We formalize the summarization task as an optimization problem of selecting proper sentences to maximize this score and address the problem by using a mixed integer programming model. The experimental results demonstrate the effectiveness of our proposed model. 相似文献
13.
Luís Marujo Ricardo Ribeiro Anatole Gershman David Martins de Matos João P. Neto Jaime Carbonell 《Knowledge and Information Systems》2017,50(3):945-968
Event detection is a fundamental information extraction task, which has been explored largely in the context of question answering, topic detection and tracking, knowledge base population, news recommendation, and automatic summarization. In this article, we explore an event detection framework to improve a key phrase-guided centrality-based summarization model. Event detection is based on the fuzzy fingerprint method, which is able to detect all types of events in the ACE 2005 Multilingual Corpus. Our base summarization approach is a two-stage method that starts by extracting a collection of key phrases that will be used to help the centrality-as-relevance retrieval model. We explored three different ways to integrate event information, achieving state-of-the-art results in text and speech corpora: (1) filtering of nonevents, (2) event fingerprints as features, and (3) combination of filtering of nonevents and event fingerprints as features. 相似文献
14.
Database summarization using fuzzy ISA hierarchies 总被引:3,自引:0,他引:3
Do Heon Lee Myoung Ho Kim 《IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics》1997,27(1):68-78
Summary discovery is one of the major components of knowledge discovery in databases, which provides the user with comprehensive information for grasping the essence from a large amount of information in a database. We propose an interactive top down summary discovery process which utilizes fuzzy ISA hierarchies as domain knowledge. We define a generalized tuple as a representational form of a database summary including fuzzy concepts. By virtue of fuzzy ISA hierarchies where fuzzy ISA relationships common in actual domains are naturally expressed, the discovery process comes up with more accurate database summaries. We also present an informativeness measure for distinguishing generalized tuples that delivers much information to users, based on C. Shannon's (1948) information theory. 相似文献
15.
Flora Amato Aniello Castiglione Vincenzo Moscato Antonio Picariello Giancarlo Sperlì 《Multimedia Tools and Applications》2018,77(14):17803-17827
In this work, we propose a novel multimedia summarization technique from Online Social Networks (OSNs). In particular, we model each Multimedia Social Network (MSN)—i.e. an OSN focusing on the management and sharing of multimedia information—using an hypergraph based approach and exploit influence analysis methodologies to determine the most important multimedia objects with respect to one or more topics of interest. Successively, we obtain from the list of candidate objects a multimedia summary using a summarization model together with an heuristics that aims to generate summaries with priority (with respect to some user keywords), continuity, variety and not receptiveness features. The performed experiments on Flickr shows the effectiveness of proposed approach. 相似文献
16.
The Journal of Supercomputing - Depression is the most prevalent mental disorder that can lead to suicide. Due to the tendency of people to share their thoughts on social platforms, social data... 相似文献
17.
18.
19.
Dynamic video summarization using two-level redundancy detection 总被引:1,自引:0,他引:1
The mushroom growth of video information, consequently, necessitates the progress of content-based video analysis techniques.
Video summarization, aiming to provide a short video summary of the original video document, has drawn much attention these
years. In this paper, we propose an algorithm for video summarization with a two-level redundancy detection procedure. By
video segmentation and cast indexing, the algorithm first constructs story boards to let users know main scenes and cast (when
this is a video with cast) in the video. Then it removes redundant video content using hierarchical agglomerative clustering
in the key frame level. The impact factors of scenes and key frames are defined, and parts of key frames are selected to generate
the initial video summary. Finally, a repetitive frame segment detection procedure is designed to remove redundant information
in the initial video summary. Results of experimental applications on TV series, movies and cartoons are given to illustrate
the proposed algorithm.
相似文献
Wei-Bo Wang |
20.
Daniel Kroening Natasha Sharygina Stefano Tonetta Aliaksei Tsitovich Christoph M. Wintersteiger 《Formal Methods in System Design》2013,42(3):221-261
This paper presents algorithms for program abstraction based on the principle of loop summarization, which, unlike traditional program approximation approaches (e.g., abstract interpretation), does not employ iterative fixpoint computation, but instead computes symbolic abstract transformers with respect to a set of abstract domains. This allows for an effective exploitation of problem-specific abstract domains for summarization and, as a consequence, the precision of an abstract model may be tailored to specific verification needs. Furthermore, we extend the concept of loop summarization to incorporate relational abstract domains to enable the discovery of transition invariants, which are subsequently used to prove termination of programs. Well-foundedness of the discovered transition invariants is ensured either by a separate decision procedure call or by using abstract domains that are well-founded by construction. We experimentally evaluate several abstract domains related to memory operations to detect buffer overflow problems. Also, our light-weight termination analysis is demonstrated to be effective on a wide range of benchmarks, including OS device drivers. 相似文献