首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于支持向量机的垃圾微博识别方法
引用本文:陈欣,郑啸,焦媛媛,陈慧娟.一种基于支持向量机的垃圾微博识别方法[J].安徽工业大学学报,2013(4):440-445.
作者姓名:陈欣  郑啸  焦媛媛  陈慧娟
作者单位:[1]安徽工业大学计算机科学与技术学院,安徽马鞍山243032 [2]西安电子科技大学计算机学院,陕西西安710071
基金项目:国家自然科学基金项目(61003311);江苏省网络与信息安全重点实验室开放课题基金项目(BM2003201-201006)
摘    要:针对中文微博垃圾特点,提取基于向量空间模型的中文文本相似度、长短链接相似度、发文时间规律等新的分类特征,加入现有的特征集,运用支持向量机方法,训练后得到分类模型.实验结果表明,该方法是一种有效的垃圾微博识别技术.

关 键 词:博文特征  用户特征  支持向量机  垃圾微博识别

A Method of Identifying Microblog Spammers Based on Support Vector Machine
CHEN Xin;ZHENG Xiao;JIAO Yuanyuan;CHEN Huijuan.A Method of Identifying Microblog Spammers Based on Support Vector Machine[J].Journal of Anhui University of Technology,2013(4):440-445.
Authors:CHEN Xin;ZHENG Xiao;JIAO Yuanyuan;CHEN Huijuan
Affiliation:CHEN Xin;ZHENG Xiao;JIAO Yuanyuan;CHEN Huijuan(School of Computer Science and Technology, Anhui University of Technology, Ma'anshan 243032, China;School of Computer, Xidian University, Xi'an 710071, China)
Abstract:To fill Chinese microblog spammer' identifying gap,some new VSM-based features such as Chinese text similarity,long and short URLs similarity,and posting regulations etc.are abstracted and put together with currently feature set,then support vector machine is employed for training and classification model is obtained.The experiment results show that the proposed method is of great effect for spammers' identification.
Keywords:status feature  user profile feature  support vector machine  microblog spammers' identification
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号