首页 | 本学科首页   官方微博 | 高级检索  
     

基于多特征融合的汉语情感分类研究*
引用本文:钟将,邓时滔.基于多特征融合的汉语情感分类研究*[J].计算机应用研究,2012,29(1):98-100.
作者姓名:钟将  邓时滔
作者单位:重庆大学计算机学院,重庆,400044
基金项目:国家“211工程”三期建设项目(S-10218)
摘    要:中文情感分类一般分成基于情感词典和基于特征分类两种方法进行研究,但没有考虑过将两种方法得到的特征进行融合来提高分类效果。基于特征分类的方法忽视了特征词在情感词典的褒贬性以及词倾向性的强弱。用基于特征分类方法得到的文本特征建立朴素贝叶斯模型,根据特征词在情感词典中的褒贬性及其通过点对互信息方法得到的词性强弱调整情感词的正负后验概率权重,实现两种特征的融合,提高分类效果并降低了特征维数。

关 键 词:文本情感分类  情感词典  点对互信息  特征选择  朴素贝叶斯

Classification approach of Chinese texts sentiment based on integrated features
ZHONG Jiang,DENG Shi-tao.Classification approach of Chinese texts sentiment based on integrated features[J].Application Research of Computers,2012,29(1):98-100.
Authors:ZHONG Jiang  DENG Shi-tao
Affiliation:(College of Computer Science,Chongqing University,Chongqing 400044,China)
Abstract:Generally the approach of Chinese text sentiment classification was based on the sentiment lexicon or the feature-selection, rather than the integration of the both involved to improve the classification effects. Feature-selection method ignored the emotional tendencies and value of words in the sentiment dictionary. This paper adopted the feature from the method of feature-selection to construct the naive Bayesian model, according to the emotional tendency of the feature in the sentiment dictionary and its value from point mutual information.And adjusted the weights of the positive and negative emotion word posterior probability to achieve the integration, improved the classification results and reduced the feature dimension.
Keywords:text sentiment classification  semantic lexicon  point wise mutual information  feature-selection  naive Bayesian
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号