首页 | 本学科首页   官方微博 | 高级检索  
     

语义角色标注中句法特征的研究
引用本文:李军辉,王红玲,周国栋,朱巧明,钱培德.语义角色标注中句法特征的研究[J].中文信息学报,2009,23(6):11-19.
作者姓名:李军辉  王红玲  周国栋  朱巧明  钱培德
作者单位:苏州大学 计算机科学与技术学院, 江苏 苏州 215006
基金项目:国家863计划资助项目,国家自然科学基金资助项目,江苏省高校自然科学基础研究项目 
摘    要:描述了一个基于特征向量的语义角色标注系统,该系统以单一句法分析树作为输入。首先进行预处理,过滤掉极不可能是角色的成分,然后进行角色分类(包括NULL类),最后处理嵌套情况及对中心语义角色去重处理。在优化组合已有特征的基础上,从语法、句型以及搭配角度出发,制定了新的有效的特征;实验表明了新特征的有效性及健壮性。最终在CoNLL-2005 Shared Task开发集和WSJ测试集上分别获得了77.54%和78.75%的F1值,是目前已知的基于单一句法分析中取得的最好性能。

关 键 词:人工智能  自然语言处理  语义角色标注  语法驱动特征  句型特征  搭配特征
  

The Research on Syntactic Features in Semantic Role Labeling
LI Junhui,WANG Hongling,ZHOU Guodong,ZHU Qiaoming,QIAN Peide.The Research on Syntactic Features in Semantic Role Labeling[J].Journal of Chinese Information Processing,2009,23(6):11-19.
Authors:LI Junhui  WANG Hongling  ZHOU Guodong  ZHU Qiaoming  QIAN Peide
Affiliation:School of Computer Science & Technology, Soochow University, Suzhou, Jiangsu 215006, China
Abstract:A featurebased semantic role labeling system operated on signal syntactic parse is constructed. The system is divided into three sequential tasks: (1) filtering out constituents that represent no semantic arguments with high probabilities, (2) classifying constituents of candidate semantic arguments into the specific categories (including NULL class), and (3) dealing with overlap arguments and constituents all labeled as corearguments in the postprocessing step. Besides combining and optimizing the existing features presented in other work, the paper extracts new features according to knowledge of grammar, pattern and collocation. The experiments show the effectiveness and robustness of the new extracted features, with which the finally SRL system achieves F1 value 77.54% and 78.75% on the development and WSJ test set respectively. As far as we know, it is the best result based on single syntactic parsers on the CoNLL2005 Shared Task.
Keywords:artificial intelligence  natural language processing  semantic role labeling  grammardriven feature  pattern feature  collocation feature
本文献已被 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号