首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于混合特征的中文情感词典扩展方法
引用本文:谢松县,赵舒怡.一种基于混合特征的中文情感词典扩展方法[J].计算机工程与科学,2016,38(7):1502-1509.
作者姓名:谢松县  赵舒怡
作者单位:;1.国防科学技术大学计算机学院;2.国家电网技术学院
摘    要:覆盖面广且领域适应性好的情感词典可以有效提高文本情感分析效能。设计了基于连词语言特征和词性特征向量统计特征的中文情感词典扩展算法,提出了综合两种方法的混合特征算法。算法计算得到词语的细粒度的积极和消极情感极性值,并对通用情感词典在领域内进行扩展以提高覆盖度,对词典进行领域内调整以提高适应性。实验结果表明,算法在领域内扩展获得的词典比通用情感词典覆盖度和适应性更好,在情感分类任务中性能接近有监督方法。

关 键 词:情感分析  情感词典  语言特征  统计特征  混合特征
收稿时间:2015-05-05
修稿时间:2016-07-25

A Chinese sentiment lexicon extension method based on mixing features
XIE Song-xian,ZHAO Shu-yi.A Chinese sentiment lexicon extension method based on mixing features[J].Computer Engineering & Science,2016,38(7):1502-1509.
Authors:XIE Song-xian  ZHAO Shu-yi
Affiliation:(1.College of Computer,National University of Defense Technology,Changsha 410073; 2.State Grid of China Technology College,Tai’an 271000 China)
Abstract:The performance of sentiment analysis can be improved effectively with the help of a wide-coverage and good domain-adapting sentiment lexicon. We firstly design two Chinese sentiment lexicon extension algorithms, which base on conjunctions feature and POS-vector statistical feature respectively. We then propose an integrated mixing feature method that combines the two algorithms. Fine-grained positive and negative values can be calculated for opinion words, the coverage of the lexicon can be improved within a domain, and the adaption of the lexicon can be improved with adjustment in the domain. Experimental results show that the extension lexicon has wider coverage and better adaption than a general lexicon in a domain, and the proposal's performance of sentiment classification can approximate that of a supervised method.
Keywords:
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号