首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于朴素贝叶斯的中文评论情感分类方法研究
引用本文:卢玲,王越,杨武.一种基于朴素贝叶斯的中文评论情感分类方法研究[J].山东大学学报(工学版),2013,43(6):7-11.
作者姓名:卢玲  王越  杨武
作者单位:1.重庆理工大学计算机科学与工程学院, 重庆 400054; 2.重庆理工大学国际合作与交流处, 重庆 400054
摘    要:提出一种新的基于朴素贝叶斯的中文文本情感分类方法。该方法用情感短语作为文本特征,通过情感词典与否定副词相结合,提取情感短语,通过CHI统计法设定阈值进行特征提取,再利用朴素贝叶斯分类器进行情感分类计算。对不同CHI阈值、不同语料库、以情感短语为特征和以情感词为特征进行分类实验。实验表明,以情感短语作为特征进行朴素贝叶斯分类,在不同领域的评论中均获得了较高的查准率和查全率,证明了该方法的可行性。

关 键 词:情感词典  情感分类  贝叶斯分类  CHI  情感短语  
收稿时间:2013-06-28

A method of sentiment classification for Chinese comments based on naive Bayesian
LU Ling,WANG Yue,YANG Wu.A method of sentiment classification for Chinese comments based on naive Bayesian[J].Journal of Shandong University of Technology,2013,43(6):7-11.
Authors:LU Ling  WANG Yue  YANG Wu
Affiliation:1. College of Computer Science and Engineering, Chongqing University of Technology, Chongqing 400054, China; 2. Office of International Cooperation,  Chongqing University of Technology, Chongqing 400054, China
Abstract:A method of sentiment classification for Chinese comments based on naive Bayesian was presented. The sentiment phrases were used as document features in the approach. The task was decomposed into three phases: the identifying sentiment phrases by sentiment dictionary and negative adverbs, the extracting features according to threshold of CHI, and the constructing sentiment classifier based on naive Bayesian. Some experiments were conducted under distinct environments, including different threshold of CHI, different feature selection, such as sentiment words or sentiment phrases, and different area of training corpuses. The experimental results showed that the classifier based on naive Bayesian working could obtain high performance.
Keywords:sentiment phrases  sentiment dictionary  naive Bayesian  sentiment classification  CHI  
点击此处可从《山东大学学报(工学版)》浏览原始摘要信息
点击此处可从《山东大学学报(工学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号