首页 | 本学科首页   官方微博 | 高级检索  
     

基于语义依存关系的汉语语料库的构建
引用本文:尤昉,李涓子,王作英.基于语义依存关系的汉语语料库的构建[J].中文信息学报,2003,17(1):46-53.
作者姓名:尤昉  李涓子  王作英
作者单位:1.清华大学电子工程系2.清华大学计算机科学与技术系
摘    要:语料库是自然语言处理中用于知识获取的重要资源。本文以句子理解为出发点,讨论了在设计和建设一个基于语义依存关系的汉语大规模语料库过程中的几个基础问题,包括:标注体系的选择、标注关系集的确定,标注工具的设计,以及标注过程中的质量控制。该语料库设计规模100万词次,利用70个语义、句法依存关系,在已具有语义类标记的语料上进一步标注句子的语义结构。其突出特点在于将《知网》语义关系体系的研究成果和具体语言应用相结合,对实际语言环境中词与词之间的依存关系进行了有效的描述,它的建成将为句子理解或基于内容的信息检索等应用提供更强大的知识库支持。

关 键 词:计算机应用  中文信息处理  语料库  语义依存关系  《知网》  动态角色与属性  
文章编号:1003-0077(2003)01-0046-08
修稿时间:2002年4月8日

On Construction of a Chinese Corpus Bused on Semantic Dependency Relations
YOU Fang ,LI Juan-zi ,WANG Zuo-ying.On Construction of a Chinese Corpus Bused on Semantic Dependency Relations[J].Journal of Chinese Information Processing,2003,17(1):46-53.
Authors:YOU Fang  LI Juan-zi  WANG Zuo-ying
Affiliation:1.Dept. of Electronics Engineering ,Tsinghua University2.Dept. of Computer Science Technology ,Tsinghua University
Abstract:Corpora are important resources for knowledge acquisition in the field of natural language processing.For the purpose of sentence understanding,we are constructing a Chinese large-scale-corpus based on semantic dependency relations.This paper introduces the tagging formalisms we adopt,the tagging set we choose,the tagging tool we develop,and the method we use to guarantee the good consistency of tagging.The corpus under discussion is at a scale of 1 million words.Each sentence in the corpus,which already had annotations of sense,is further tagged with its semantic structure using 70 semantic-dependency-relations.The highlight of this corpus is its ability to effectively describe various relations between Chinese words.All of these profited from using for reference and the combination with specific use of language.The construction of this corpus can definitely provide more knowledge supports for sentence understanding,content-based information retrieval,and so on.
Keywords:computer application  Chinese information processing  corpus  semantic dependency relations  HowNet  Event Role & Features
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号