首页 | 本学科首页   官方微博 | 高级检索  
     

基于情感倾向特征的文本情感聚类
引用本文:薛宾,王素格,张鹏,廖建.基于情感倾向特征的文本情感聚类[J].电脑开发与应用,2013(2):1-5.
作者姓名:薛宾  王素格  张鹏  廖建
作者单位:山西大学数学科学学院;山西大学计算机与信息技术学院;山西大学计算智能与中文信息处理教育部重点实验室
基金项目:国家自然科学基金(Nos.61175067,60970014);山西省自然科学基金(No.2010011021-1);山西省科技攻关基金资助项目(No.20110321027-02)
摘    要:利用领域本体对产品评论文本中的特征及其评价词进行抽取,并将特征评价词的情感倾向与特征所在句子的情感倾向进行特征表示,得到文本特征矩阵,在此基础上,利用K-means算法实现了文本的情感聚类。为了验证该方法的有效性,在真实汽车评论文本数据上进行实验,结果表明,基于特征的情感倾向表示的权重相比布尔权重和LDA特征权重的聚类结果,在聚类的纯度和F值上有明显提高。

关 键 词:文本聚类  特征表示  情感倾向

A Feature Representation Algorithm Based on Emotional Tendency for Texts Clustering
XUE Bin,WANG Su-ge,ZHANG Peng,LIAO Jian.A Feature Representation Algorithm Based on Emotional Tendency for Texts Clustering[J].Computer Development & Applications,2013(2):1-5.
Authors:XUE Bin  WANG Su-ge  ZHANG Peng  LIAO Jian
Affiliation:1.School of Mathematics Science,Shanxi University,Taiyuan 030006,China 2.School of Computer and Information Technology,Shanxi University,Taiyuan 030006,China 3.Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education,Shanxi University,Taiyuan 030006,China)
Abstract:Firstly,feature and evaluation word are extracted form product review texts based on domain ontology.On that basis,making use of the feature evaluation word emotional tendency with the sentence emotional tendency to carry out the feature representation gets the text feature matrix to use KMeans clustering,realize the evaluation object emotional clustering.In order to validate the effectiveness of the proposed method,in the real car comment text data on the experiment,the experiment results show that the feature of emotional tendency,than clustering results feature representation using Boolean and LDA,the clustering result have increased significantly in the purity and F-measure.
Keywords:text clustering  feature representation  emotional tendency
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号