首页 | 本学科首页   官方微博 | 高级检索  
     

基于HowNet和PMI的词语情感极性计算
引用本文:王振宇,吴泽衡,胡方涛.基于HowNet和PMI的词语情感极性计算[J].计算机工程,2012,38(15):187-189,193.
作者姓名:王振宇  吴泽衡  胡方涛
作者单位:1. 华南理工大学软件学院,广州,510006
2. 华南理工大学计算机科学与工程学院,广州,510006
基金项目:广东省科技计划基金资助项目“基于情感极性分析的互联网敏感信息监控系统项目号”
摘    要:基于语料库的点互信息(PMI)计算方法依赖于语料库的完善性,基于HowNet的计算方法则依赖于知网相似度计算的准确性。为克服2种方法的局限性,提出一种HowNet和PMI相融合的词语极性计算方法,利用知网进行同义词扩展,降低情感词在语料库中出现频率低所带来的问题。实验结果表明,该方法的微平均和宏平均性能比传统方法提升约5%。

关 键 词:情感分析  点互信息  知网  同义词扩展  相似度
收稿时间:2011-08-09

Words Sentiment Polarity Calculation Based on HowNet and PMI
WANG Zhen-yu , WU Ze-heng , HU Fang-tao.Words Sentiment Polarity Calculation Based on HowNet and PMI[J].Computer Engineering,2012,38(15):187-189,193.
Authors:WANG Zhen-yu  WU Ze-heng  HU Fang-tao
Affiliation:a(a.School of Software;b.School of Computer Science and Engineering,South China University of Technology,Guangzhou 510006,China)
Abstract:The polarity calculation of word level is the basis of sentiment analysis of sentence level and discourse level.The traditional calculation methods based on Point Mutual Information(PMI) or HowNet have their own defects: methods of PMI depend on the perfection of the corpus,and methods of HowNet depend on accuracy of the similarity calculation based on HowNet.In order to improve these deficiencies,an improved method for calculating the polarity of words is proposed,combining HowNet with PMI.First of all,HowNet is used to expand the synonyms of the emotional words in order to reduce the impact of some emotional words which have low frequency in the corpus,and then,according to the similarity calculation based on HowNet,it integrates the similarity based on HowNet with that of PMI.Experimental results show the new method increases micro average and macro average by 5% compared with traditional methods.
Keywords:sentiment analysis  Point Mutual Information(PMI)  HowNet  synonym expansion  similarity
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号