一种改进的融合关联词典的微博倾向性分析方法 |
| |
引用本文: | 赵军王红朱华方.一种改进的融合关联词典的微博倾向性分析方法[J].数据采集与处理,2016,31(6):1220-1227. |
| |
作者姓名: | 赵军王红朱华方 |
| |
作者单位: | 1.山东师范大学信息科学与工程学院,济南,250014; 2.山东省分布式计算软件新技术重点实验室,济南,250014 |
| |
摘 要: | 大多数研究者对微博倾向性分析过多关注的是情感词、形容词和否定词,忽略了
关联词对其情感倾向的影响。为了提高微博情感倾向性分析的准确率,提出了融合关联词的微博倾向性分析方法,考虑微博文本中形容词、程度副词以及关联词之间的组合关系。
本文充分考虑了关联词的结构特点并在已有词典的基础上构建专门用于微博倾向性分析的微博词典、否定词词典和关联词词典,同时考虑到网络新词对微博倾向性的影响,还构建
了一个全新的网络新词词典。借助支持向量机(Support vector machine,SVM)将微博文本分为负向、正向和中性3
类,通过结合情感词典和SVM的方法提高微博文本倾向性分析的准确率。通过对COASE 2014
数据实验可以表明,本文方法对微博倾向性分析取得了较好的效果。
|
关 键 词: | 中文微博 倾向分析 支持向量机 关联词 |
Improved Method for Analyzing Microblog Orientation Based on Association Lexicon |
| |
Affiliation: | 1.School of Information Science and Engineering, Shandong Normal University, Jinan, 250014, China; 2.Shandong Provincial Key Laboratory for Distributed Computer Software Novel Technology, Jinan, 250014, China |
| |
Abstract: | At present, a larger number of researchers
focus on Micro-blog orientation on the emotional words, adverb and negative words without considering the impact of connectives. To improve the accuracy of orientation analysis, a method of analyzing Mico-blog orientation is proposed. In the paper, we sufficiently analyze the structure characteristics of associated words and consider the combination laws of negative words , adversative words and conjunctions in Microblog. In addition,
a specific dictionary is created based on the existing resources, which contains a turning words lexicon, a connective lexicon and a negative words lexicon. At
the same time, we take into account the impact of new network words and phrases
of the microblog text, so we also build a new network words dictionary. Therefore, the Microblog texts are classified into three categories including negative,
positive and neutral one by support vector machine (SVM). By combining Lexicon-based and SVM machine learning method, better accuracy of classification can be achieved. Experimental results verify that the method achieves higher classification accuracy through experiments using COASE 2014. |
| |
Keywords: | Chinese microblog orientation analysis support vector machine (SVM) connectives |
|
| 点击此处可从《数据采集与处理》浏览原始摘要信息 |
|
点击此处可从《数据采集与处理》下载全文 |
|