首页 | 本学科首页   官方微博 | 高级检索  
     

一种支持高效并行处理的矢量数据索引方法
引用本文:褚龙现,李晓英,陈 旭,楚纯洁.一种支持高效并行处理的矢量数据索引方法[J].计算机工程与应用,2017,53(11):79-84.
作者姓名:褚龙现  李晓英  陈 旭  楚纯洁
作者单位:1.平顶山学院 软件学院,河南 平顶山 467000 2.桂林理工大学 南宁分校,南宁 530001 3.武汉大学 软件工程国家重点实验室,武汉 430072 4.平顶山学院 资源与环境科学学院,河南 平顶山 467000
摘    要:分析了HBase的存储模型和Spark的并行处理机制,提出一种矢量空间数据的分布式存储、索引和并行区域查询方法。设计了基于空间对象中心点的行键存储方案,将中心点的Hilbert编码与经纬度小数位结合实现行键的唯一性,保证地理位置接近的要素在表中存储在相邻的行。实现了基于Spark的空间索引并行构建和区域查询方法,借助空间对象中心点的Hilbert编码快速构建索引,通过多边形区域的最小外接矩形过滤查询结果。实验结果表明,索引并行构建可靠性好速度快,区域查询并行处理算法可行且效率高。

关 键 词:spark  hilbert  矢量数据  空间索引  分布式存储  

Vector data index method supporting efficient parallel compute
CHU Longxian,LI Xiaoying,CHEN Xu,CHU Chunjie.Vector data index method supporting efficient parallel compute[J].Computer Engineering and Applications,2017,53(11):79-84.
Authors:CHU Longxian  LI Xiaoying  CHEN Xu  CHU Chunjie
Affiliation:1.School of Software, Pingdingshan University, Pingdingshan, Henan 467000, China 2.Campus of Nanning, Guilin University of Technology, Nanning 530001, China 3.State Key Laboratory of Software Engineering, Wuhan University, Wuhan 430072, China 4.School of Resources and Environmental Science, Pingdingshan University, Pingdingshan, Henan 467000, China
Abstract:By analyzing the HBase storage model and the parallel compute mechanism of Spark, a distributed storage, index and parallel regional query method of vector spatial data is proposed. A row key storage scheme which combines the Hilbert code of central point and decimal place of longitude and latitude is designed. This scheme reaches the uniqueness of row key and guarantees the effect that the most nearest elements in geographical position are stored in the adjacent rows. A spatial index parallel build and regional query method based on Spark is realized, which generates index quickly by using the Hilbert code of spatial central points, and filters the query result by the minimum bounding rectangle of polygon regions. Simulation results show that the parallel build of index is reliability and fast, and the parallel compute algorithm based on regional query is feasible and efficient.
Keywords:hilbert  vector data  spatial index  distributed storage  
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号