首页 | 本学科首页   官方微博 | 高级检索  
     

信息抽取的语义知识资源研究
引用本文:袁毓林.信息抽取的语义知识资源研究[J].中文信息学报,2002,16(5):10-16.
作者姓名:袁毓林
作者单位:北京大学中文系
基金项目:教育部人文社会科学研究“十五”规划第一批研究项目(01JB740006)
摘    要:本文讨论支持信息抽取的语义资源的建设问题, 举例说明了信息抽取至少需要三种层面的语义知识:(i)宏观的话语篇章知识, 籍此可以约束信息抽取的匹配模板的类型, 预测关键性的信息项目在文本中的分布位置;(ii)中观的论元结构知识, 籍此可以建立动词的论元成分跟事件模板的传递与继承关系, 帮助确定代词或空语类跟其先行语的回指关系, 进而确定其语义所指;(iii)微观的逻辑结构知识, 籍此可以确定否定词、量化词、模态词等逻辑算子跟其所约束的成分之间的逻辑关系(比如, 哪些成分处于否定的辖城之中, 其中哪个成分是否定的焦点, 在哪些语法条件下否定词是冗余的, 等等)。最后, 指出研究这三种语义知识所可利用的几种理论和方法。

关 键 词:信息抽取  语义资源  话语篇章  论元结构  逻辑结构  

On the Semantic Knowledge Resources for Information Extraction
YUAN Yu-lin.On the Semantic Knowledge Resources for Information Extraction[J].Journal of Chinese Information Processing,2002,16(5):10-16.
Authors:YUAN Yu-lin
Abstract:This paper discusses the matter with the semantic knowledge resources for information extraction (briefly, IE) via many examplescome from real Chinese texts. It demonstrates that a workable IE system at least needs following three levels of semantic knowledge as supporting resources: (i) the discourse structure knowledge of real text, by which the IE system can expect the type of information template and the distribution of the key information items; (ii) the argument structure knowledge of key sentences in real text, by which the propagation and inheritance relation between argument constituents and event template, and the anaphorical relation between the pronouns or empty categories and their precedents can be determined; (iii)logic structure knowledge,by which the IE system can decide the logic relation between the logic operators(e. g. ,negative word,quantifier and modal word,etc. )and their bound elements,e. g. ,the scope and focus of negative words,and the grammatical condition under which the negative word is redundant. Finally, it suggests briefly the available theories and methods to investigate the aforementioned semantic knowledge.
Keywords:information extraction  semantic knowledge resources  discourse structure  argument structure  logic structure
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号