首页 | 本学科首页   官方微博 | 高级检索  
     

基于模式学习的中文问答系统答案抽取方法
引用本文:余正涛,毛存礼,邓锦辉,章程,郭剑毅.基于模式学习的中文问答系统答案抽取方法[J].吉林大学学报(工学版),2008,38(1):142-147.
作者姓名:余正涛  毛存礼  邓锦辉  章程  郭剑毅
作者单位:昆明理工大学,信息工程与自动化学院,昆明,650051;云南省计算机技术应用重点实验室,智能信息处理研究所,昆明,650051;昆明理工大学,信息工程与自动化学院,昆明,650051
基金项目:国家自然科学基金 , 教育部高等学校博士学科点专项科研基金 , 云南省中青年学术技术带头人后备人才培养计划 , 云南省教育厅资助项目 , 昆明理工大学校科研和教改项目
摘    要:答案抽取是中文问答系统的关键,而通常答案是借助于问题的答案句子模式抽取得到,由于答案句子模式是语言专家根据语言规则提炼获得,因此非常依赖于专家经验。针对这一局限性,提出了一种利用模式学习来获得中文答案句子模式的方法,该方法利用搜索引擎从互连网上检索相关问题文本,人工提取包含答案的句子段,并标注问题类型及答案,形成各种问题类型的问答训练语料。通过统计学习,提取候选答案句子模式,计算候选句子模式权重,并根据权重获得相应问题类型的答案句子模式。基于事实的问题答案抽取结果表明,提出的基于模式学习的方法有很好的效果,实验答案提取准确率值达到了0.28,学习方法获得的模式基本上覆盖了常规答案句子模式。

关 键 词:计算机软件  问答系统  答案抽取  模式学习  模式匹配
文章编号:1671-5497(2008)01-0142-06
收稿时间:2006-10-24
修稿时间:2006年10月24

Answer extraction scheme for Chinese question answering system based on pattern learning
Yu Zheng-tao,Mao Cun-li,Deng Jin-hui,Zhang Cheng,Guo Jian-yi.Answer extraction scheme for Chinese question answering system based on pattern learning[J].Journal of Jilin University:Eng and Technol Ed,2008,38(1):142-147.
Authors:Yu Zheng-tao  Mao Cun-li  Deng Jin-hui  Zhang Cheng  Guo Jian-yi
Abstract:Answer extraction is the key of the Chinese questioning system.Normally answer extraction mainly depends on the pattern of the answer sentence.Since the pattern of the answer sentence is obtained by experts based on the language rules,so it strongly relies on the experts' language knowledge. To overcome this limit,a scheme is proposed to gain the pattern of the answer sentence by pattern learning.The scheme takes the advantage of searching engine to retrieve related documents of question.From these documents the sentences that include the answers are extracted.Then the types of the questions and answers are marked to form question-answer training corpus to the questions of different types.Then by statistic learning method,the candidates of sentence patterns are abstracted and the weights of the patterns are calculated.Thus,based on the weights the patterns of the answer sentences to the questions of different types are obtained.Answer extraction result of the factoid question shows that the experimental MRAR is up to 0.28,which indicates the effectiveness of the proposed pattern learning scheme.The patterns gained by pattern learning cover the normal answer sentences.
Keywords:computer software  question answering system  answer extracting  pattern learning  pattern matching
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《吉林大学学报(工学版)》浏览原始摘要信息
点击此处可从《吉林大学学报(工学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号