首页 | 本学科首页   官方微博 | 高级检索  
     

基于上下文信息的中文命名实体消岐方法研究
引用本文:王旭阳,姜喜秋.基于上下文信息的中文命名实体消岐方法研究[J].计算机应用研究,2018,35(4).
作者姓名:王旭阳  姜喜秋
作者单位:兰州理工大学 计算机与通信学院,中通快递总部
基金项目:国家自然科学(61563030)
摘    要:在语义标注过程中,为了消除文本中给定的命名实体与知识库中实体映射过程中出现的歧义问题,提出了一种基于上下文信息相似度值排序的命名实体消歧方法。消岐方法包括实体表示预处理、候选实体列表构建和相似度值排序算法三部分。针对命名实体指称多样性问题,使用实体表示预处理方法抽取标准实体。然后利用中文在线百科构建语义知识库,得到标准实体的语义列表。同时提出利用相似度值排序方法解决标准实体与语义列表映射的指称歧义性问题,对于在知识库中未找到语义的实体采用HAC聚类算法进行消岐处理。实验结果表明,本文提出的方法能够有效的把中文网页真实数据集中文本的实体映射到知识库中对应无歧义的实体上。

关 键 词:命名实体  语义知识库  聚类  语义列表
收稿时间:2016/12/14 0:00:00
修稿时间:2018/2/27 0:00:00

Chinese Named Entity Disambiguation Method Research Based on Context Information
Wang Xuyang and Jiang Xiqiu.Chinese Named Entity Disambiguation Method Research Based on Context Information[J].Application Research of Computers,2018,35(4).
Authors:Wang Xuyang and Jiang Xiqiu
Affiliation:College of Computer and Communication,LanZhou University of Technology,
Abstract:In the process of semantic annotation, in order to eliminate the ambiguity problem of the text in a given named entity and the mapping of the knowledge base entities.A context based semantic similarity value of the sorted named entity disambiguation method is put forward.Disambiguation method included three sections that entity preprocessing,constructing candidate list of entities and similarity value ranking algorithms. In view of the problem of the named entity reference multiplicity, the new entity was used to represent the preprocess method to extract the standard entity. Then used the online encyclopedia in Chinese to construct the semantic knowledge base, and got the semantic list of standard entities. At the same time, this paper also put forward using the similarity value ranking method for solving standard substance and semantic list mapping referential ambiguity problem, for in the knowledge base not found semantic entity disambiguation processing by clustering algorithm.The results of the experiment show that the proposed method can effectively reflect the real data set of Chinese web pages to the corresponding non-ambiguous entities in the knowledge base.
Keywords:named entity  semantic knowledge base  clustering  semantic list
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号