首页 | 本学科首页   官方微博 | 高级检索  
     

基于Tri-training的评价单元识别
引用本文:蒋 润,顾春华,阮 彤.基于Tri-training的评价单元识别[J].计算机应用,2014,34(4):1099-1104.
作者姓名:蒋 润  顾春华  阮 彤
作者单位:华东理工大学 信息科学与工程学院,上海 200237
摘    要:评价单元的识别是情感倾向性分析中重要的一步,但由于标注语料匮乏,大多数研究集中在用人工构建规则、模板来识别评价单元的方法上。为了减轻标注训练语料的工作,同时进一步挖掘未标记样本的信息,提出一种基于协同训练机制的评价单元识别算法,以利用少量的已标记样本和大量的未标记样本来提高识别性能。该算法利用Tri-training的思想,将支持向量机(SVM)、最大熵(MaxEnt)以及条件随机场(CRF)三个不同分类器组合成一个分类体系,对生成的评价单元候选集进行分类。将Tri-training的算法思想应用于实验来对比采用单一分类器的方法,结果表明,该算法能够有效地识别主观句中的评价单元。

关 键 词:半监督学习  协同训练  Tri-training  评价单元  依存分析  评价对象
收稿时间:2013-10-08
修稿时间:2013-12-08

Appraisal expression recognition based on Tri-training
JIANG Run GU Chunhua RUAN Tong.Appraisal expression recognition based on Tri-training[J].journal of Computer Applications,2014,34(4):1099-1104.
Authors:JIANG Run GU Chunhua RUAN Tong
Affiliation:School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
Abstract:Appraisal expression recognition is very important in sentiment analysis. Because of the lack of labeled corpus, most former works in appraisal expression recognition are focused on construction of rules and templates manually. In order to reduce the training work of labeling corpus and further mining information of unlabeled corpus, a new algorithm based on co-training was proposed, which mainly used massive unlabeled corpus and only a small number of labeled corpus. The proposed algorithm was based on Tri-training and combined Support Vector Machine (SVM), Maximum Entropy (MaxEnt) and Conditional Random Field (CRF) to build a new approach for candidate appraisal expression classification. By comparing the Tri-training based algorithm with the former single classifier based algorithms, the former can effectively improve the performance of appraisal expression recognition in subjective sentences.
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号