首页 | 本学科首页   官方微博 | 高级检索  
     

基于CUDA的并行K-近邻连接算法实现
引用本文:潘茜,张育平,陈海燕.基于CUDA的并行K-近邻连接算法实现[J].计算机科学,2016,43(10):190-192, 219.
作者姓名:潘茜  张育平  陈海燕
作者单位:南京航空航天大学计算机科学与技术学院 南京210016,南京航空航天大学计算机科学与技术学院 南京210016,南京航空航天大学计算机科学与技术学院 南京210016
基金项目:本文受国家重点基础研究发展计划(973计划)(2014CB744900),南京航空航天大学研究生创新基地开放基金(KFJJ201460)资助
摘    要:针对大规模空间数据的K-近邻连接查询问题,设计了一种CUDA编程模型下K-近邻连接算法的并行优化方法。将K-近邻连接算法的并行过程分两个阶段:1)对参与查询的数据集P和Q分别建立R-Tree索引;2)基于R-Tree索引进行KNNJ查询。首先根据结点所在位置划分最小外包框,在CUDA下基于递归网格排序算法创建R-Tree索引。然后在CUDA下基于R-Tree索引进行KNNJ查询,其中涉及并行求距离和并行距离排序两个阶段:求距离阶段利用每一个线程计算任意两点之间的距离,点与点之间距离的求取无依赖并行;排序阶段将快速排序基于CUDA以实现并行化。实验结果表明,随着样本量的不断增大,基于R-Tree索引的并行K-近邻连接算法的优势更加明显,具有高效性和可扩展性。

关 键 词:CUDA  K-近邻连接  空间查询  并行计算  R-Tree索引
收稿时间:7/7/2015 12:00:00 AM
修稿时间:2015/12/13 0:00:00

Implementation of Parallel K-Nearest Neighbor Join Algorithm Based on CUDA
PAN Qian,ZHANG Yu-ping and CHEN Hai-yan.Implementation of Parallel K-Nearest Neighbor Join Algorithm Based on CUDA[J].Computer Science,2016,43(10):190-192, 219.
Authors:PAN Qian  ZHANG Yu-ping and CHEN Hai-yan
Affiliation:School of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 210016,China,School of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 210016,China and School of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 210016,China
Abstract:In order to solve the problem of K-nearest neighbor join query in large scale spatial data,a parallel optimization method of K-nearest neighbor join algorithm based on CUDA programming model was designed.The parallel process of K-nearest neighbor join algorithm is divided into two stages.One is to establish the R-Tree index for the data set Q and P participate in the query,and the other is to carry out the KNNJ query based on R-Tree index.Firstly,MBR is created according to the location of nodes,and the R-Tree index is created based on SRT by CUDA.Then,the KNNJ query is made based on the R-Tree index,including parallel computing and parallel sorting.The distance between two points can be calculated by each thread on the parallel,and quicksort is executed in parallel on the CUDA.Experimental results show that with the increase of sample size,the advantages of parallel K-nearest neighbor algorithm are more obvious,which has high efficiency and scalability.
Keywords:CUDA  K-neighbor join  Spatial query  Parallel computing  R-Tree index
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号