首页 | 本学科首页   官方微博 | 高级检索  
     

基于语义理解的文本倾向性识别机制
引用本文:徐琳宏,林鸿飞,杨志豪.基于语义理解的文本倾向性识别机制[J].中文信息学报,2007,21(1):96-100.
作者姓名:徐琳宏  林鸿飞  杨志豪
作者单位:大连理工大学 计算机科学与工程系,辽宁 大连 116024
摘    要:文本倾向性识别在垃圾邮件过滤、信息安全和自动文摘等领域都有广泛的应用。本文提出了基于语义理解的文本倾向性识别机制。其主要思想是首先计算词汇与知网中已标注褒贬性的词汇间的相似度,获取词汇的倾向性;再选择倾向性明显的词汇作为特征值,用SVM分类器分析文本的褒贬性;最后采用否定规则匹配文本中的语义否定的策略提高分类效果,同时处理程度副词附近的褒义词和贬义词,以加强对文本褒贬义强度的识别。

关 键 词:计算机应用  中文信息处理  倾向性识别  知网    语义相似度    否定句  程度副词  
文章编号:1003-0077(2007)01-0096-05
收稿时间:2006-07-13
修稿时间:2006-07-132006-10-08

Text Orientation Identification Based on Semantic Comprehension
XU Lin-hong,LIN Hong-fei,YANG Zhi-hao.Text Orientation Identification Based on Semantic Comprehension[J].Journal of Chinese Information Processing,2007,21(1):96-100.
Authors:XU Lin-hong  LIN Hong-fei  YANG Zhi-hao
Affiliation:Dep. of Computer Science and Engineering, Dalian University of Technology, Dalian, Liaoning 116024, China
Abstract:At the fields of spam filtering,information security and automatic summarizations,text orientation identification is used widely. The paper presents the mechanism based on Semantic Comprehension for text orientation identification.Firstly,it acquires the semantic orientation through computing semantic similarity the vocabulary and tagged vocabulary in How-Net,and it adopts the derogatory or commendatory terms as features of classification.It utilizes Support Vector Machine classifier to identify the text orientation.Finally it deals with the negative sentence via matching negative rules.At the same time,it also identifies the derogatory or commendatory intensity through degree adverb in order to improve the accuracy of classification.
Keywords:computer application  Chinese information processing  orientation identification  HowNet  semantic similarity  negative sentence  degree adverb
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号