基于种子词的微博表情符情感倾向判定方法 |
| |
引用本文: | 王伟周咏梅阳爱民林江豪陈昱宏曾文俊. 基于种子词的微博表情符情感倾向判定方法[J]. 数据采集与处理, 2017, 32(1): 198-204 |
| |
作者姓名: | 王伟周咏梅阳爱民林江豪陈昱宏曾文俊 |
| |
作者单位: | 1.广东外语外贸大学思科信息学院,广州,510006; 2.广东外语外贸大学语言工程与计算实验室,广州,510006; 3.广东外语外贸大学财务处,广州,510420 |
| |
摘 要: | 情感倾向明显的表情符,容易通过人工进行标注。但是对于情感倾向不明显的表情符,多人手工的标注结果往往难以达成一致。因此,提出一种利用种子词自动判定表情符情感倾向的方法。该方法利用少量种子表情符自动标注情感倾向比较明显的表情符,生成表情符标注集;对于情感倾向不明显的表情符,利用种子情感词和已得到的表情符标注集构建模型,实现其情感倾向的自动判定。实验结果表明,本文方法在微博表情符情感倾向的自动判定上有很好的效果。
|
关 键 词: | 情感分类;机器学习;微博表情符;种子词;自动标注 |
Determination Method for Sentiment Orientation of Microblog Smileys Based on Seed Words |
| |
Abstract: | The smileys with obvious sentiment orientation are easily annotated manually. But the annotations of the smileys with unobvious sentiment orientation are difficult to reach a consensus. A method of automatically determining the sentiment orientation of the microblog smileys with the seed words is proposed. The method automatically annotates the corpus smileys with obvious sentiment orientation using a few seed emotions. Then these smileys are used to generate the labeled smiley set (LSS). Moreover, a model is built based on the seed emotional words and LSS to determine the smileys with unobvious sentiment orientation. Experimental results show that the presented method is effective. |
| |
Keywords: | sentiment classification machine learning microblog smileys seed words automatic labeling |
|
| 点击此处可从《数据采集与处理》浏览原始摘要信息 |
|
点击此处可从《数据采集与处理》下载全文 |