首页 | 本学科首页   官方微博 | 高级检索  
     

基于粗糙集的微博用户性别识别
引用本文:黄发良 熊金波 黄添强 刘西蒙. 基于粗糙集的微博用户性别识别[J]. 计算机应用, 2014, 34(8): 2209-2211. DOI: 10.11772/j.issn.1001-9081.2014.08.2209
作者姓名:黄发良 熊金波 黄添强 刘西蒙
作者单位:1. 福建师范大学 软件学院,福州3500072. 西安电子科技大学 计算机学院,西安710071
基金项目:教育部人文社会科学研究青年基金资助项目;福建省教育厅科技项目
摘    要:针对微博消息往往会不同程度表现出性别倾向性的特点,从消息内容挖掘的角度出发提出了一种基于粗糙集的微博用户性别识别算法。设计了一种基于容差粗集的微博消息表示模型(TRSRM),有效地刻画微博消息的性别特征。实验结果表明,在1000个真实微博用户的微博消息的测试集下,所提模型的准确率比特征项频数表示模型平均提高了7%,取得了更好的识别效果。

收稿时间:2014-04-29
修稿时间:2014-05-02

Gender identification of microblog users based on rough set
HUANG Faliang XIONG Jinbo HUANG Tianqiang LIU Ximeng. Gender identification of microblog users based on rough set[J]. Journal of Computer Applications, 2014, 34(8): 2209-2211. DOI: 10.11772/j.issn.1001-9081.2014.08.2209
Authors:HUANG Faliang XIONG Jinbo HUANG Tianqiang LIU Ximeng
Affiliation:1. Faculty of Software, Fujian Normal University, Fuzhou Fujian 350007, China;
2. School of Computer Science and Technology, Xidian University, Xi'an Shaanxi 710071, China
Abstract:Concerning gender tendency hidden in microblog messages posted by microblog users, a novel approach based on rough set theory was proposed to identify microblog user gender. In the proposed approach, a new Representation Model based on Tolerance Rough Set (TRSRM) was devised, which can effectively represent gender characteristics of microblog messages. The experimental results show that the accuracy rate of the proposed approach is 7% higher than frequency model approach by testing messages of 1000 real microblog users, and so the TRSRM achieves better recognition performance.
Keywords:
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号