Time Aware Knowledge Extraction for microblog summarization on Twitter |
| |
Affiliation: | 1. Data Analysis and Modeling Lab Dept. Computer Science, Palacky University, Olomouc 17. listopadu 12, CZ-77146 Olomouc, Czech Republic;2. Institute of Computer Science, Faculty of Science Pavol Jozef Šafárik University in Košice Jesenná 5, 040 01 Košice, Slovakia;1. Department of Informatics and Statistics, Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil;2. Natural Language Processing Group, Department of Communication and Information Technologies, Pompeu Fabra University, Spain |
| |
Abstract: | Microblogging services like Twitter and Facebook collect millions of user generated content every moment about trending news, occurring events, and so on. Nevertheless, it is really a nightmare to find information of interest through the huge amount of available posts that are often noisy and redundant. In the era of Big Data, social media analytics services have caught increasing attention from both research and industry. Specifically, the dynamic context of microblogging requires to manage not only meaning of information but also the evolution of knowledge over the timeline. This work defines Time Aware Knowledge Extraction (briefly TAKE) methodology that relies on temporal extension of Fuzzy Formal Concept Analysis. In particular, a microblog summarization algorithm has been defined filtering the concepts organized by TAKE in a time-dependent hierarchy. The algorithm addresses topic-based summarization on Twitter. Besides considering the timing of the concepts, another distinguishing feature of the proposed microblog summarization framework is the possibility to have more or less detailed summary, according to the user’s needs, with good levels of quality and completeness as highlighted in the experimental results. |
| |
Keywords: | Microblog summarization Time awareness Fuzzy Formal Concept Analysis Big Data Social media analytics |
本文献已被 ScienceDirect 等数据库收录! |
|