首页 | 本学科首页   官方微博 | 高级检索  
     


BC-iDistance: an optimized high-dimensional index for KNN processing
Authors:LIANG Jun-jie  FENG Yu-cai
Affiliation:[1]Faculty of Mathematics and Computer Science, Hubei University, Wuhan 430062, China; [2]College of Computer Science and Technology, Huazhong University of Scienec and Technology, Wuhan 430074, China
Abstract:To facilitate high-dimensional KNN queries, based on techniques of approximate vector presentation and one-dimensional transformation, an optimal index is proposed, namely Bit-Code based iDistance ( BC-iDis-tance). To overcome the defect of much information loss for iDistance in one-dimensional transformation, the BC-iDistance adopts a novel representation of compressing a d-dimensional vector into a two-dimensional vector, and employs the concepts of bit code and one-dimensional distance to reflect the location and similarity of the data point relative to the corresponding reference point respectively. By employing the classical B + tree, this representation realizes a two-level pruning process and facilitates the use of a single index structure to further speed up the processing. Experimental evaluations using synthetic data and real data demonstrate that the BC-iDistance outperforms the iDistance and sequential scan for KNN search in high-dimensional spaces.
Keywords:high-dimensional index  KNN seareh  bit code  approximate vector
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号