首页 | 本学科首页   官方微博 | 高级检索  
     

U-Net:用于包含无答案问题的机器阅读理解的轻量级模型
引用本文:孙付,李林阳,邱锡鹏,刘扬,黄萱菁. U-Net:用于包含无答案问题的机器阅读理解的轻量级模型[J]. 中文信息学报, 2021, 35(2): 99-106
作者姓名:孙付  李林阳  邱锡鹏  刘扬  黄萱菁
作者单位:1.复旦大学 计算机学院,上海 201210; 2.流利说 硅谷人工智能实验室,美国 旧金山 94104
基金项目:国家自然科学基金(61672162)
摘    要:处理机器阅读理解任务时,识别其中没有答案的问题是自然语言处理领域的一个新的挑战.该文提出U-Net模型来处理这个问题,该模型包括3个主要成分:答案预测模块、无答案判别模块和答案验证模块.该模型用一个U节点将问题和文章拼接为一个连续的文本序列,该U节点同时编码问题和文章的信息,在判断问题是否有答案时起到重要作用,同时对于...

关 键 词:机器阅读理解  SQuAD  注意力机制
收稿时间:2019-11-22

U-Net: Lightweight Model for Machine Reading Comprehension with Unanswerable Questions
SUN Fu,LI Linyang,QIU Xipeng,LIU Yang,HUANG Xuanjing. U-Net: Lightweight Model for Machine Reading Comprehension with Unanswerable Questions[J]. Journal of Chinese Information Processing, 2021, 35(2): 99-106
Authors:SUN Fu  LI Linyang  QIU Xipeng  LIU Yang  HUANG Xuanjing
Affiliation:1. School of Computer Science, Fudan University, Shanghai 201210, China; 2. Liulishuo Silicon Valley AI Lab, San Francisco 94104, USA
Abstract:Machine reading comprehension with unanswerable questions is a challenging task. In this paper, we propose a unified model, called U-Net, with three important components: answer pointer, no-answer pointer, and answer verifier. We introduce a universal node which processes the question and its context passage as a single contiguous sequence of tokens. The universal node encodes the fused information from both the question and passage, and plays an important role to predict whether the question is answerable and also greatly improves the conciseness of the U-Net. Different from the models based on pre-trained BERT, universal node fuses information from passage and question in a variety of ways and avoids the huge computation. The single U-Net model achieves the F1 score of 72.6 and the EM score of 69.3 on SQuAD 2.0, and the ensemble version, 74.9 and 71.4, respectively. Both version of U-Net models rank top among the models without a large scale pre-trained language model.
Keywords:machine reading comprehension  SQuAD  attention mechanism  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号