首页 | 本学科首页   官方微博 | 高级检索  
     

中文阅读理解语料库构建技术研究
引用本文:郝晓燕,李济洪,由丽萍,刘开瑛.中文阅读理解语料库构建技术研究[J].中文信息学报,2007,21(6):29-35.
作者姓名:郝晓燕  李济洪  由丽萍  刘开瑛
作者单位:1. 太原理工大学 计算机与软件学院 山西 太原 030024;
2. 山西大学 山西 太原 030006
基金项目:国家高技术研究发展计划(863计划)
摘    要:阅读理解问答系统指的是能够自动分析一个自然语言文章,并且根据文中的信息为每个问题生成一个答案的系统,具有很高的研究价值。然而,缺乏中文阅读理解语料库已经成为制约汉语阅读理解问答系统发展的主要障碍。本文对于中文阅读理解语料库的构建过程进行了详细的介绍,包括语料选材、编写问句,标注答案句、语料加工和评测机制,尤其是基于汉语框架语义知识库对语料进行了框架元素、短语类型和句法功能三个层面标注的深加工技术。

关 键 词:计算机应用  中文信息处理  阅读理解问答系统  中文阅读理解语料库  汉语框架语义知识库  
文章编号:1003-0077(2007)06-0029-07
收稿时间:2007-03-12
修稿时间:2007-07-26

A Research on Building of Chinese Reading Comprehension Corpus
HAO Xiao-yan,LI Ji-hong,YOU Li-ping,LIU Kai-ying.A Research on Building of Chinese Reading Comprehension Corpus[J].Journal of Chinese Information Processing,2007,21(6):29-35.
Authors:HAO Xiao-yan  LI Ji-hong  YOU Li-ping  LIU Kai-ying
Affiliation:1. Academe of Computer & Software Engineering, Taiyuan University of Technology, Taiyuan,
Shanxi 030024; China; 2. Shanxi University, Taiyuan, Shanxi 030006, China
Abstract:A Question Answering System for Reading Comprehension(QARC)can automatically analyze a passage of natural language text and generate an answer for each question based on information in the passage.The reading comprehension task can be a valuable tool to evaluate the performance of a natural language understanding system.Unfortunately,insufficiency of Chinese Reading Comprehension Corpus(CRCC)is the main problem to the research and development of Chinese QARC.The paper describes in detail the process of building a Chinese Reading Comprehension Corpus(CRCC),including materials selecting,questions compiling,answers labeling,corpus processing and evaluation methods.In particular,we annotated texts on such three layers as frame element,phrase type and syntactic function,based on the knowledge base of Chinese FrameNet(CFN).
Keywords:computer application  Chinese information processing  question answering system for reading comprehension  Chinese reading comprehension corpus  Chinese framenet
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号