首页 | 本学科首页   官方微博 | 高级检索  
     

卷积神经网络下的Twitter文本情感分析
引用本文:王煜涵,张春云,赵宝林,袭肖明,耿蕾蕾,崔超然.卷积神经网络下的Twitter文本情感分析[J].数据采集与处理,2018,33(5):921-927.
作者姓名:王煜涵  张春云  赵宝林  袭肖明  耿蕾蕾  崔超然
作者单位:1. 山东财经大学计算机科学与技术学院, 济南, 250014;2. 浪潮电子信息产业股份有限公司存储研发部, 济南, 250101
基金项目:山东省高等学校优势学科人才团队培育计划资助项目;山东省自然科学杰出青年基金(JQ201316)资助项目;山东省自然科学基金(ZR2016FQ18)资助项目;山东省高等学校科技计划(J17KA065)资助项目。
摘    要:随着社交网络的日益普及,基于Twitter文本的情感分析成为近年来的研究热点。Twitter文本中蕴含的情感倾向对于挖掘用户需求和对重大事件的预测具有重要意义。但由于Twitter文本短小和用户自身行为存在随意性等特点,再加之现有的情感分类方法大都基于手工制作的文本特征,难以挖掘文本中隐含的深层语义特征,因此难以提高情感分类性能。本文提出了一种基于卷积神经网络的Twitter文本情感分类模型。该模型利用word2vec方法初始化文本词向量,并采用CNN模型学习文本中的深层语义信息,从而挖掘Twitter文本的情感倾向。实验结果表明,采用该模型能够取得82.3%的召回率,比传统分类方法的分类性能有显著提高。

关 键 词:Twitter文本  情感分析  词向量模型  卷积神经网络
收稿时间:2017/6/7 0:00:00
修稿时间:2017/6/26 0:00:00

Sentiment Analysis of Twitter Data Based on CNN
Wang Yuhan,Zhang Chunyun,Zhao Baolin,Xi Xiaoming,Geng Leilei,Cui Chaoran.Sentiment Analysis of Twitter Data Based on CNN[J].Journal of Data Acquisition & Processing,2018,33(5):921-927.
Authors:Wang Yuhan  Zhang Chunyun  Zhao Baolin  Xi Xiaoming  Geng Leilei  Cui Chaoran
Affiliation:1. School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan, 250014, China;2. Storage R & D Department, Inspur Electronic Information Industry Co., Ltd, Jinan, 250101, China
Abstract:With the increasing popularity of social networks, sentiment analysis based on Twitter text has become a hotspot in recent years. The sentiment tendencies contained in tweets are important for mining user needs and predicting major events. However, the existing sentiment classification methods are mostly based on hand-made text features, and it is hard to mine implicit deep semantics of texts. In addition, because of special characteristics, such as short text and arbitrariness of users'' behavior, it is more difficult to improve performance of current sentiment classification. This paper presents a novel Twitter sentiment classification model based on convolutional neural network (CNN). In order to explore sentiment tendency of tweets, the proposed model utilizes a dynamic CNN architecture to learn deep semantics from tweets, which initializes input word embedding with word2vec method. Experimental results show that our proposed model can achieve a recall rate of 82.3%, which is much higher than performances of traditional classification methods.
Keywords:Twitter data  sentiment analysis  word embedding model  convolutional neural network (CNN)
点击此处可从《数据采集与处理》浏览原始摘要信息
点击此处可从《数据采集与处理》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号