首页 | 本学科首页   官方微博 | 高级检索  
     

基于文本特征的短文本倾向性分析研究
引用本文:程南昌,侯敏,滕永林.基于文本特征的短文本倾向性分析研究[J].中文信息学报,2015,29(2):163-169.
作者姓名:程南昌  侯敏  滕永林
作者单位:1. 中国科学院自动化研究所 模式识别国家重点实验室,北京 100190;
2. 中国传媒大学 国家语言资源监测与研究中心有声媒体语言中心, 北京 100024)
基金项目:国家语委十二五规划重点项目(ZDI125-3)
摘    要:语篇倾向性分析是倾向性分析的较高层次领域。根据文本篇幅和结构可以将语篇分为短文本和长文本。该文以网络商品评论作为样本研究短文本倾向性分析的特点和策略。根据倾向极性在文中的决定性因素的不同表现,短文本可以分为含显性归总句、含隐性归总句、含特征词以及一般文本四类,针对不同类别文本采用不同的处理策略。在此基础上,运用词典、规则的方法构建了语篇倾向性分析系统CUCsas,该方法在第四届中文倾向性分析评测(COAE2012)中取得了较好成绩。

关 键 词:短文本  文本特征  归总句  倾向性分析  词典与规则  

Short Text Attitude Analysis Based on Textual Characteristics
CHENG Nanchang;HOU Min;TENG Yonglin.Short Text Attitude Analysis Based on Textual Characteristics[J].Journal of Chinese Information Processing,2015,29(2):163-169.
Authors:CHENG Nanchang;HOU Min;TENG Yonglin
Affiliation:1. National Laboratory of Pattern Recognition, Institute of Automation,
Chinese Academy of Sciences, Beijing 100190, China;
2. Broadcast Media Language Branch, National Langage Resources Monitoring and Research Center, Communication University of China, Beijing 100024, China
Abstract:This paper takes the online product reviews as samples to investigate the characteristics and strategies in the attitude analysis of short texts. According to different performances of decisive factors of attitude polarity, the online review texts can be divided into four categories: the text containing overt summery sentence, the texts containing covert summary sentence, the texts containing characteristic words and the normal texts. Different strategies are established to deal with different types of texts, and a text attitude analysis system CUCsas is constructed based on dictionaries and rules. The system generates promising results in the Fourth Chinese Opinion Analysis Evaluation- COAE2012.
Keywords:short text  textual characteristics  summary sentence  attitude analysis  dictionary and rules  
本文献已被 CNKI 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号