首页 | 本学科首页   官方微博 | 高级检索  
     

一种准确而高效的领域知识图谱构建方法
引用本文:杨玉基,许斌,胡家威,仝美涵,张鹏,郑莉. 一种准确而高效的领域知识图谱构建方法[J]. 软件学报, 2018, 29(10): 2931-2947
作者姓名:杨玉基  许斌  胡家威  仝美涵  张鹏  郑莉
作者单位:清华大学 计算机科学与技术系知识工程实验室, 北京 海淀 100084,清华大学 计算机科学与技术系知识工程实验室, 北京 海淀 100084,清华大学 计算机科学与技术系知识工程实验室, 北京 海淀 100084,清华大学 计算机科学与技术系知识工程实验室, 北京 海淀 100084,清华大学 计算机科学与技术系知识工程实验室, 北京 海淀 100084,清华大学 计算机科学与技术系知识工程实验室, 北京 海淀 100084
基金项目:国家高技术研究发展计划(863)
摘    要:作为语义网的数据支撑,知识图谱在知识问答、语义搜索等领域起着至关重要的作用,一直以来也是研究领域和工程领域的一个热点问题,但是构建一个质量较高、规模较大的知识图谱往往需要花费巨大的人力和时间成本.如何平衡准确率和效率,快速地构建出一个高质量的领域知识图谱,是知识工程领域的一个重要挑战.本文对领域知识图谱构建方法做了系统研究,提出了一种准确高效的领域知识图谱构建方法——“四步法”,我们将此方法应用到中国基础教育九门学科知识图谱的构建中,在较短时间构建出了准确率较高的学科知识图谱,证明了该方法构建领域知识图谱的有效性.以地理学科知识图谱为例,使用“四步法”共得到67万个实例,1421万条三元组,其中标注数据的学科知识覆盖率和知识准确率均在99%以上.

关 键 词:语义网  知识图谱  本体  语义标注  实体集扩充  关系抽取
收稿时间:2017-07-22
修稿时间:2017-11-08

Accurate and Efficient Method for Constructing Domain Knowledge Graph
YANG Yu-Ji,XU Bin,HU Jia-Wei,TONG Mei-Han,ZHANG Peng and ZHENG Li. Accurate and Efficient Method for Constructing Domain Knowledge Graph[J]. Journal of Software, 2018, 29(10): 2931-2947
Authors:YANG Yu-Ji  XU Bin  HU Jia-Wei  TONG Mei-Han  ZHANG Peng  ZHENG Li
Affiliation:Knowledge Engineering Group, Tsinghua University, Beijing 100084, China,Knowledge Engineering Group, Tsinghua University, Beijing 100084, China,Knowledge Engineering Group, Tsinghua University, Beijing 100084, China,Knowledge Engineering Group, Tsinghua University, Beijing 100084, China,Knowledge Engineering Group, Tsinghua University, Beijing 100084, China and Knowledge Engineering Group, Tsinghua University, Beijing 100084, China
Abstract:As the supporting data of semantic web, knowledge graphs have played a vital role in knowledge QA, semantic search and so on. Therefore, they have been a hot topicin the field of research and engineering. However, it is often costly to build a large-scale knowledge graph withhigh accuracy. How to balance the accuracy and efficiency, and quickly build a high-quality domain knowledge graph, is a big challenge in the field of knowledge engineering. This paper makes a systematic study on the construction of domain knowledge graphs, and puts forward an accurate and efficient method of constructing domain knowledge graphs, "Four-steps". We have applied this method to the construction of knowledge graphs of nine subjects in the Chinese k12 education, and developed the nine subject knowledge graphs with high accuracy, which demonstrates that our method is effective. For example, the geographical knowledge graph, which is constructed using the "Four-steps" method, has got 670 thousand instances and 14.21 million triples. And as part of it, the annotation data''s knowledge coverage and knowledge accuracyare both above 99%.
Keywords:semantic web  knowledge graph  ontology  semantic annotation  entity set expansion  relation extraction
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号