首页 | 本学科首页   官方微博 | 高级检索  
     

考虑用户特征的主题情感联合模型
引用本文:许银洁,孙春华,刘业政.考虑用户特征的主题情感联合模型[J].计算机应用,2018,38(5):1261-1266.
作者姓名:许银洁  孙春华  刘业政
作者单位:合肥工业大学 管理学院, 合肥 230009
基金项目:教育部人文社科基金资助项目(15YJC630111)。
摘    要:现有的主题情感联合(JST)模型能够同时识别文本中的主题和情感,但是现有的JST模型主要是对文本内容建模,没有考虑用户特征,导致情感分析结果出现用户人口统计偏差和行为事件偏差。提出了考虑用户特征的主题情感联合(JUST)模型,JUST模型的主要改进之处在于,将用户特征加入模型,以文档所对应的用户特征的线性函数作为文档-情感分布的先验,由此得到具有不同特征的用户群体的情感倾向。在汽车之家网站(www.autohome.com.cn)的13252条汽车评论数据集上,检验了JUST模型的有效性,实验结果表明,加入用户特征的JUST模型情感分类效果优于JST模型和TSMMF模型,同时比较了汽车之家网站上不同特征用户之间的关注主题情感差异。

关 键 词:情感分析  用户特征  主题模型  隐含狄利克雷分布  吉布斯采样  
收稿时间:2017-11-15
修稿时间:2017-12-22

Joint sentiment/topic model integrating user characteristics
XU Yinjie,SUN Chunhua,LIU Yezheng.Joint sentiment/topic model integrating user characteristics[J].journal of Computer Applications,2018,38(5):1261-1266.
Authors:XU Yinjie  SUN Chunhua  LIU Yezheng
Affiliation:School of Management, Hefei University of Technology, Hefei Anhui 230009, China
Abstract:The Joint Sentiment/Topic (JST) model can extract both the topic and the sentiment from the text, but the existing JST model mainly focuses on textual content, without considering the user characteristics, which may lead to demographic and event biases in sentiment mining reports. The Joint-User Sentiment/Topic (JUST) model was proposed. The main improvement of the JUST model was that the user characteristics were added to the model, a linear function of the user characteristics corresponding to the document was used as a priori of the document-emotional distribution, so the model could get emotional tendencies of different topics from customer with different characteristics. The validity of the JUST model was tested on the datasets of 13252 automobile review from autohome.com (www.autohome.com.cn). The experimental results show that the accuracy of the sentiment classification of the JUST model is higher than those of the JST model and TSMMF (Topic Sentiment Model based on Multi-feature Fusion) model. The topic and sentiment differences between users with different characteristics were also compared.
Keywords:sentiment analysis                                                                                                                        user characteristics                                                                                                                        topic model                                                                                                                        Latent Dirichlet Allocation (LDA)                                                                                                                        Gibbs sampling
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号