首页 | 本学科首页   官方微博 | 高级检索  
     

N-Reader:基于双层Self-attention的机器阅读理解模型
引用本文:梁小波,任飞亮,刘永康,潘凌峰,侯依宁,张熠,李妍. N-Reader:基于双层Self-attention的机器阅读理解模型[J]. 中文信息学报, 2018, 32(10): 130-137
作者姓名:梁小波  任飞亮  刘永康  潘凌峰  侯依宁  张熠  李妍
作者单位:东北大学 计算机科学与工程学院,辽宁 沈阳 110169
基金项目:国家自然科学基金(61572120,61672138,61432013,61671070)
摘    要:机器阅读理解是自然语言处理和人工智能领域的重要前沿课题,近年来受到学术界和工业界的广泛关注。为了提升机器阅读理解系统在多文档中文数据集上的处理能力,我们提出了N-Reader,一个基于神经网络的端到端机器阅读理解模型。该模型的主要特点是采用双层self-attention机制对单个文档以及所有输入文档进行编码。通过这样的编码,不仅可以获取单篇文档中的关键信息,还可以利用多篇文档中的相似性信息。另外,我们还提出了一种多相关段落补全算法来对输入文档进行数据预处理。这种补全算法可以帮助模型识别出具有相关语义的段落,进而预测出更好的答案序列。使用N-Reader模型,我们参加了由中国中文信息学会、中国计算机学会和百度公司联手举办的“2018机器阅读理解技术竞赛”,取得了第3名的成绩。

关 键 词:机器阅读理解  神经网络  注意力机制  

N-Reader: Machine Reading Comprehension Model Based on Double Layers of Self-attention
LIANG Xiaobo,REN Feiliang,LIU Yongkang,PAN Lingfeng,HOU Yining,ZHANG Yi,LI Yan. N-Reader: Machine Reading Comprehension Model Based on Double Layers of Self-attention[J]. Journal of Chinese Information Processing, 2018, 32(10): 130-137
Authors:LIANG Xiaobo  REN Feiliang  LIU Yongkang  PAN Lingfeng  HOU Yining  ZHANG Yi  LI Yan
Affiliation:School of Computer Science and Engineering, Northeastern University, Shenyang, Liaoning 110169, China
Abstract:Machine reading comprehension (MRC) is an important task in natural language processing and artificial intelligence. To improve the Chinese multi-document MRC, this paper proposes N-Reader, an end-to-end MRC model based on neural network. It applies a two-layer self-attention mechanism to encode the input documents to ultilize both the information from a single document and the similarity information from multiple documents. Besides, this paper also proposes a multi-paragraph completion algorithm to preprocess the input documents. This preprocessing method can further recognize the semantics-related paragraphs among input documents, and contribute to a better answer sequence. In the “2018 NLP Challenge on Machine Reading Comprehension” jointly organized by Chinese Information Processing Society of China (CIPS), Chinese Computer Federation (CCF), and Baidu Inc., our model ranks No.3 among the hugely competitive models.
Keywords:machine reading comprehension    neural network    attention mechanism  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号