首页 | 本学科首页   官方微博 | 高级检索  
     

基于复杂网络的情感分类特征选择
引用本文:张向阳,那日萨. 基于复杂网络的情感分类特征选择[J]. 计算机应用研究, 2017, 34(4)
作者姓名:张向阳  那日萨
作者单位:大连理工大学系统工程研究所,大连理工大学系统工程研究所
基金项目:国家自然科学基金面上项目(61471083);教育部人文社科研究规划(14YJA630044)。
摘    要:首先针对公共情感词典对专业领域适用性较低问题,以公共情感词典作为种子情感词典,以评论语料库中未出现在公共情感词典中的形容词作为候选情感词,在此基础之上利用点互信息理论构建专业领域的情感词典。其次针对在线评论情感分类问题,利用复杂网络理论提出了一种新的情感分类特征选择算法,改进了传统特征选择算法忽略特征语义相关信息,遗漏评论情感资源的问题。通过构建候选特征词关系网络,利用复杂网络节点重要性理论,考虑节点的局部和全局重要性,提出了利用网络节点的度中心性、介数中心性和接近中心性综合衡量节点重要性来选择情感分类特征的算法NTFS(Complex network feature selection)。最后以iPhone手机的在线评论为实验数据,利用SVM、NNET、NB分类器对比了NTFS、GI、CHI传统特征选择方法,实验证明NTFS在分类性能上优于GI,CHI算法。

关 键 词:复杂网络  特征选择  情感分类,情感词典
收稿时间:2016-03-09
修稿时间:2016-04-19

Emotional Classification Feature Selection Based On Complex Network
Zhang Xiang-yang and Zhao Na-ri-sa. Emotional Classification Feature Selection Based On Complex Network[J]. Application Research of Computers, 2017, 34(4)
Authors:Zhang Xiang-yang and Zhao Na-ri-sa
Affiliation:Institute of Systems Engineering,Dalian University of Technology,Institute of Systems Engineering,Dalian University of Technology
Abstract:Firstly, the sentiment word dictionary of professional field is created based on point mutual information theory, public domain dictionary emotion is used as seed semantic lexicon, and adjective words in review crops which is not contained the public domain dictionary emotion is used as candidate emotional characteristics, the reason what we do is the applicability of public domain dictionary emotion used for professional field is not good. Secondly, we propose a new algorithm for feature selection of emotional classification for online review based on complex network. It makes better for semantic relativity between feature words, so that more emotive information is got. The relational complex network is created for candidate feature by complex network theory. Then the part and overall important of nodes is considered. We use degree centrality, betweeness centrality, closeness centrality to measure important of nodes for select emotional classification feature, the algorithm is named NTFS(Complex network feature selection).At last ,online reviews of iphon is used for test data, SVM,NNET,NB are used for classifier, we compare NTFS with GI and CHI, the result shows that NTFS is better than GI,CHI for emotional classification on classification performance.
Keywords:complex network   feature selection   emotional classification   semantic lexicon
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号