首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于OWL的语义网数据划分与并行加载方法
引用本文:程佳,朱敏,柏文阳.一种基于OWL的语义网数据划分与并行加载方法[J].微机发展,2014(2):19-24.
作者姓名:程佳  朱敏  柏文阳
作者单位:[1]南京大学计算机软件新技术国家重点实验室,江苏南京210023 [2]南京大学计算机科学与技术系,江苏南京210023
基金项目:国家社会科学基金项目(11AZD121);国家“863”高技术研究发展计划项目(2011AA01A202)
摘    要:随着语义网数据规模的爆炸式增长,海量数据存储和检索面临越来越严峻的挑战,分布式数据库与并行计算已成为其主要解决方案。基于列存储分布式数据库HBase设计了一种多表语义网数据存储模型,实现从OWL本体定义到存储模型的映射。基于OWL本体定义信息对语义网数据实现按类划分,并将三元组存储于主体所属于的类的两张表里,采用MapReduce框架实现并行的数据划分和加载任务,最后在Hadoop集群环境下对方法进行了可行性验证。

关 键 词:语义网  网络本体语言  列存储  并行计算

A Method of Semantic Web Data Division and Parallel Loading Based on OWL
CHENG Jia,ZHU Min,BAI Wen-yang.A Method of Semantic Web Data Division and Parallel Loading Based on OWL[J].Microcomputer Development,2014(2):19-24.
Authors:CHENG Jia  ZHU Min  BAI Wen-yang
Affiliation:(State Key Laboratory for Novel Software Technology ,Nanjing 210023 ,China , Department of Computer Science and Technology ,Nanjing University ,Nanjing 210023 ,China)
Abstract:With the rapid growth of semantic Web data scale, mass data storage and retrieval are facing growing challenges, and distributed database and parallel computing has become its major solutions. Design a multi-table storage model to store semantic Web data with HBase which is a distributed database based on column store, as to achieve a mapping from OWL ontology to storage model. And then, divide semantic Web data by class which the subject of its triple belongs to and store the triple into the two HTables of the class. Divide and load data in parallel by MapReduce framework. Finally, verify the feasibility of this method in the Hadoop cluster.
Keywords:semantic Web  OWL  column +store  parallel computing
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号