首页 | 本学科首页   官方微博 | 高级检索  
     

一种改进的融合关联词典的微博倾向性分析方法
引用本文:赵军王红朱华方.一种改进的融合关联词典的微博倾向性分析方法[J].数据采集与处理,2016,31(6):1220-1227.
作者姓名:赵军王红朱华方
作者单位:1.山东师范大学信息科学与工程学院,济南,250014; 2.山东省分布式计算软件新技术重点实验室,济南,250014
摘    要:大多数研究者对微博倾向性分析过多关注的是情感词、形容词和否定词,忽略了 关联词对其情感倾向的影响。为了提高微博情感倾向性分析的准确率,提出了融合关联词的微博倾向性分析方法,考虑微博文本中形容词、程度副词以及关联词之间的组合关系。 本文充分考虑了关联词的结构特点并在已有词典的基础上构建专门用于微博倾向性分析的微博词典、否定词词典和关联词词典,同时考虑到网络新词对微博倾向性的影响,还构建 了一个全新的网络新词词典。借助支持向量机(Support vector machine,SVM)将微博文本分为负向、正向和中性3 类,通过结合情感词典和SVM的方法提高微博文本倾向性分析的准确率。通过对COASE 2014 数据实验可以表明,本文方法对微博倾向性分析取得了较好的效果。

关 键 词:中文微博  倾向分析  支持向量机  关联词

Improved Method for Analyzing Microblog Orientation Based on Association Lexicon
Affiliation:1.School of Information Science and Engineering, Shandong Normal University, Jinan, 250014, China; 2.Shandong Provincial Key Laboratory for Distributed Computer Software Novel Technology, Jinan, 250014, China
Abstract:At present, a larger number of researchers focus on Micro-blog orientation on the emotional words, adverb and negative words without considering the impact of connectives. To improve the accuracy of orientation analysis, a method of analyzing Mico-blog orientation is proposed. In the paper, we sufficiently analyze the structure characteristics of associated words and consider the combination laws of negative words , adversative words and conjunctions in Microblog. In addition, a specific dictionary is created based on the existing resources, which contains a turning words lexicon, a connective lexicon and a negative words lexicon. At the same time, we take into account the impact of new network words and phrases of the microblog text, so we also build a new network words dictionary. Therefore, the Microblog texts are classified into three categories including negative, positive and neutral one by support vector machine (SVM). By combining Lexicon-based and SVM machine learning method, better accuracy of classification can be achieved. Experimental results verify that the method achieves higher classification accuracy through experiments using COASE 2014.
Keywords:Chinese microblog  orientation analysis  support vector machine (SVM)  connectives
点击此处可从《数据采集与处理》浏览原始摘要信息
点击此处可从《数据采集与处理》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号