基于规则的复句关系词的自动标识 Rule Based Identification of Compound Sentences Relation Words期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于规则的复句关系词的自动标识

引用本文：	贾遂民,雷利利,胡明生.基于规则的复句关系词的自动标识[J].中文信息学报,2015,29(1):44-48.

作者姓名：	贾遂民雷利利胡明生

作者单位：	1.郑州师范学院信息科学与技术学院,河南郑州 450044; 2. 河南财经税务高等专科学校综合实验实训中心,河南郑州 451464

基金项目：	国家自然科学基金(U1204703);中央高校基本科研业务费资助(HUST: 2012QN087, 2012QN088);河南省重点科技攻关项目(122102310004);郑州市创新型科技人才队伍建设工程(10LJRC190)

摘要：	关系词的自动标识是中文信息处理领域的基础性研究课题,该文利用规则实现其自动标识。首先通过语料的分析总结出关系词在使用过程中的12种特征,以这些特征建立规则的约束条件;然后提出包含匹配算法实现复句准关系词序列与规则索引词的匹配,以此获取目标规则,并根据目标规则约束条件与关系词所在语境的匹配结果得到匹配规则;最后利用匹配规则的结论实现关系词的自动标识。实验结果表明,该方法对关系词标识的正确率达到70.9%。
关键词：	关系词规则复句自动标识
Rule Based Identification of Compound Sentences Relation Words

JIA Suimin,LEI Lili,HU Mingsheng.Rule Based Identification of Compound Sentences Relation Words[J].Journal of Chinese Information Processing,2015,29(1):44-48.

Authors:	JIA Suimin LEI Lili HU Mingsheng

Affiliation:	1. College of Information Science & Technology, Zhengzhou Normal University, Zhengzhou, Henan 450044, China; 2. Comprehensive Experimental & Training Center, HeNan College of Finace & Taxation, Zhengzhou, Henan 451464, China

Abstract:	Automatic identifying the relation words of compound sentences is a fundamental issue in the field of Chinese information processing. This paper describe a rule based method for automatic identification of compound sentence relation words. To construct the rule, 12 featuresare summarized from the corpus. Then a match algorithm is described to obtaind the candidate relation word sequence. Finally the context of the relation words is employed to match with the rules. Experiment results show that this method achieves an accuracy of 70.9%.

Keywords:	relation words rule compound sentences auto-identifying

	点击此处可从《中文信息学报》浏览原始摘要信息
	点击此处可从《中文信息学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏