Predicting the Lifespan and Retweet Times of Tweets Based on Multiple Feature Analysis |
| |
Authors: | Yongjin Bae Pum‐Mo Ryu Hyunki Kim |
| |
Affiliation: | 1. Yongjin Bae (phone: +82 10 2399 1036, yongjin@etri.re.kr) is with the SW·Content Research Laboratory, ETRI, Daejeon and also is a Master's student in the Department of Computer Software and Engineering, University of Science and Technology, Daejeon, Rep. of Korea.;2. Pum‐Mo Ryu (pmryu@etri.re.kr) and Hyunki Kim (hkkim@etri.re.kr) are with the SW·Content Research Laboratory, ETRI, Daejeon, Rep. of Korea. |
| |
Abstract: | In social network services, such as Facebook, Google+, Twitter, and certain postings attract more people than others. In this paper, we propose a novel method for predicting the lifespan and retweet times of tweets, the latter being a proxy for measuring the popularity of a tweet. We extract information from retweet graphs, such as posting times; and social, local, and content features, so as to construct prediction knowledge bases. Tweets with a similar topic, retweet pattern, and properties are sequentially extracted from the knowledge base and then used to make a prediction. To evaluate the performance of our model, we collected tweets on Twitter from June 2012 to October 2012. We compared our model with conventional models according to the prediction goal. For the lifespan prediction of a tweet, our model can reduce the time tolerance of a tweet lifespan by about four hours, compared with conventional models. In terms of prediction of the retweet times, our model achieved a significantly outstanding precision of about 50%, which is much higher than two of the conventional models showing a precision of around 30% and 20%, respectively. |
| |
Keywords: | Tweet lifespan retweet popularity prediction social network |
|
|