首页 | 本学科首页   官方微博 | 高级检索  
     

基于情感词属性和云模型的文本情感分类方法
引用本文:孙劲光,马志芳,孟祥福. 基于情感词属性和云模型的文本情感分类方法[J]. 计算机工程, 2013, 0(12): 211-215,222
作者姓名:孙劲光  马志芳  孟祥福
作者单位:[1]辽宁工程技术大学电子与信息工程学院,辽宁葫芦岛125105 [2]辽宁工程技术大学研究生学院,辽宁葫芦岛125105
基金项目:国家科技支撑计划基金资助项目(2013bah12f00)
摘    要:受语言固有的模糊性、随机性以及传统文本特征词权重值计算方法不适用于情感词等因素的影响,文本情感分类的正确率很难达到传统文本主题分类的水平。为此,提出一种基于情感词属性和云模型的情感分类方法。结合情感词属性和简单句法结构以确定情感词的权重值,并利用云模型对情感词进行定性定量表示的转换。实验结果表明,该方法对情感词权重值计算是有效的,召回率最高达到78.8%,且与基于词典的方法相比,其文本情感分类结果更精确,正确率最高达到68.4%,增加了约9%的精度。

关 键 词:观点挖掘  文本挖掘  情感分类  云模型  情感词属性  文本特征提取

Classification Method of Texts Sentiment Based on Sentiment Word Attributes and Cloud Model
SUN Jin-guanga,MA Zhi-fangb,MENG Xiang-fu. Classification Method of Texts Sentiment Based on Sentiment Word Attributes and Cloud Model[J]. Computer Engineering, 2013, 0(12): 211-215,222
Authors:SUN Jin-guanga  MA Zhi-fangb  MENG Xiang-fu
Affiliation:a (a. School of Electronics and Information Engineering; b. Institute of Graduate, Liaoning Technical University, Huludao 125105, China)
Abstract:In the era of big data, how to obtain valid information from the Web becomes a keen topic for business, government, and research workers. User's opinion mining becomes a research topic for the area of Natural Language Processing(NLP) and text mining. However, due to the inherent fuzziness and randomness of language, as well as the traditional term weight value calculation method is not suitable for the sentiment word and other factors, the text sentiment classification accuracy is difficult to achieve the performance of traditional text subject classification. To solve these problems, this paper proposes a sentiment classification method based on sentiment word attributes and cloud model. It calculates weight of sentiment words by combining attributes and syntactic structure of sentiment words, and converts qualitative and quantitative of sentiment words based on cloud model. Experimental results show that this method to calculate weights of sentiment words is valid, and the recall rate is up to 78.8%. Text sentiment classification results are more accurate than that based on dictionary, the correction rate is up to 68.4%, and the accuracy is increased by about 9%.
Keywords:opinion mining  text mining  sentiment classification  cloud model  sentiment word attributes  text feature extraction
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号