首页 | 本学科首页   官方微博 | 高级检索  
     

基于局部语义相关性的定义文本义原预测
引用本文:杜家驹,岂凡超,孙茂松,刘知远. 基于局部语义相关性的定义文本义原预测[J]. 中文信息学报, 2020, 34(5): 1-9
作者姓名:杜家驹  岂凡超  孙茂松  刘知远
作者单位:1.清华大学 计算机科学与技术系,北京 100084;
2.清华大学 人工智能研究院,北京 100084;
3.清华大学 智能技术与系统国家重点实验室,北京 100084
基金项目:国家自然科学基金(61661146007)
摘    要:作为人类语言的最小语义单位,义原已被成功应用于许多自然语言处理任务。人工构造和更新义原知识库成本较大,因此义原预测被用来辅助义原标注。该文探索了利用定义文本为词语自动预测义原的方法。词语的各个义原通常都与定义文本中的不同词语的语义有相关关系,这种现象被称为局部语义相关性。与之对应,该文提出了义原相关池化(SCorP)模型,该模型能够利用局部语义相关性来预测义原。在HowNet上的评测结果表明,SCorP取得了当前最好的义原预测性能。大量的定量分析进一步证明了SCorP模型能够正确地学习义原与定义文本之间的局部语义相关性。

关 键 词:义原预测  HowNet  语义相关性

Lexical Sememe Prediction by Dictionary Definitions and LocalSemantic Correspondence
DU Jiaju,QI Fanchao,SUN Maosong,LIU Zhiyuan. Lexical Sememe Prediction by Dictionary Definitions and LocalSemantic Correspondence[J]. Journal of Chinese Information Processing, 2020, 34(5): 1-9
Authors:DU Jiaju  QI Fanchao  SUN Maosong  LIU Zhiyuan
Affiliation:1.Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;
2.Institute of Artificial Intelligence, Tsinghua University, Beijing 100084, China;
3.State Key Laboratory of Intelligent Technology and Systems, Tsinghua University, Beijing 100084, China
Abstract:Sememes, defined as the minimum semantic units of human languages in linguistics, have been proven useful in many NLP tasks. Since manual construction and update of sememe knowledge bases (KBs) are costly, the task of automatic sememe prediction has been used to assist sememe annotation. In this paper, we explore the method of applying dictionary definitions to predicting sememes for unannotated words. We find that sememes of each word are usually semantically related to different words in its dictionary definition, and we name this matching relationship local semantic correspondence. Accordingly, we propose a Sememe Correspondence Pooling (SCorP) model which is able to capture this kind of matching to predict sememes. Evaluated on HowNet, our model is revealed with state-of-the-art performance, capable of properly learning local semantic correspondence between sememes and words in dictionary definitions.
Keywords:sememe prediction    HowNet    semantic relevance  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号