首页 | 本学科首页   官方微博 | 高级检索  
     

多段落中文阅读理解模型
引用本文:赵峻瑶,庞亮,苏立新,兰艳艳,郭嘉丰,程学旗.多段落中文阅读理解模型[J].模式识别与人工智能,2019,32(2):161-168.
作者姓名:赵峻瑶  庞亮  苏立新  兰艳艳  郭嘉丰  程学旗
作者单位:1.中国科学院计算技术研究所 网络数据科学与技术重点实验室 北京 100190;
2.中国科学院大学 计算机与控制学院 北京 100190
基金项目:国家重点研发计划(2016QY02D0405)、国家自然科学基金项目(No.61425016,61472401,61722211,61872338,61773362,20180290)、中国青年创新协会CAS项目(No.20144310,20160280)资助
摘    要:解决多段落中文阅读理解任务需要考虑证据段落的稀疏性、中文语义的多样性和答案片段的有效性.基于此种情况,文中设计多段落中文阅读理解模型,利用数据增强的方式学习不包含答案的段落,利用字级别编码和中文词性标注丰富中文的语义表示,通过答案片段的特征训练答案有效性验证模型.将文中模型应用到CIPS-SOGOU事实类问答数据中,实验表明,完全匹配率和F1分数的平均分均有所提高.

关 键 词:阅读理解  智能问答  数据增强
收稿时间:2018-10-21

Chinese Multi-paragraph Reading Comprehension Model
ZHAO Junyao,PANG Liang,SU Lixin,LAN Yanyan,GUO Jiafeng,CHENG Xueqi.Chinese Multi-paragraph Reading Comprehension Model[J].Pattern Recognition and Artificial Intelligence,2019,32(2):161-168.
Authors:ZHAO Junyao  PANG Liang  SU Lixin  LAN Yanyan  GUO Jiafeng  CHENG Xueqi
Affiliation:1.Key Laboratory of Network Data Science and Technology, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190;
2.School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing 100190
Abstract:In the Chinese multi-paragraph reading comprehension task, three properties should be taken into account: the sparsity of evidence paragraph, the diversity of Chinese semantic and the validity of answer snippet. To solve these problems, a Chinese multi-paragraph reading comprehension model, CMPReader, is proposed. In CMReader, data augmentation is exploited to learn the paragraphs with no answer. Word level encoding and Chinese word tag are added to enrich the Chinese semantic representation, and the features of answer snippet are employed by the answer verifier model to choose the right answer. CMPReader is applied to the CIPS-SOGOU factoid question answer dataset, and the results show that the average of exact match score and F1 score are increased.
Keywords:Reading Comprehension  Question Answer  Data Augmentation  
本文献已被 维普 等数据库收录!
点击此处可从《模式识别与人工智能》浏览原始摘要信息
点击此处可从《模式识别与人工智能》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号