共查询到20条相似文献,搜索用时 15 毫秒
1.
Lee K.C.K. Baihua Zheng Wang-Chien Lee 《Knowledge and Data Engineering, IEEE Transactions on》2008,20(7):894-910
Given a set of data points P and a query point q in a multidimensional space, reverse nearest neighbor (RNN) query finds data points in P whose nearest neighbors are q. Reverse k-nearest neighbor (RkNN) query (where k ges 1) generalizes RNN query to find data points whose kNNs include q. For RkNN query semantics, q is said to have influence to all those answer data points. The degree of q's influence on a data point p (isin P) is denoted by kappap where q is the kappap-th NN of p. We introduce a new variant of RNN query, namely, ranked reverse nearest neighbor (RRNN) query, that retrieves t data points most influenced by q, i.e., the t data points having the smallest kappa's with respect to q. To answer this RRNN query efficiently, we propose two novel algorithms, kappa-counting and kappa-browsing that are applicable to both monochromatic and bichromatic scenarios and are able to deliver results progressively. Through an extensive performance evaluation, we validate that the two proposed RRNN algorithms are superior to solutions derived from algorithms designed for RkNN query. 相似文献
2.
为解决动态环境中移动点的连续反向最近邻查询问题,将连续反向最近邻查询分为单色和双色2种情况进行研究。利用移动点Voronoi图,分别给出单色连续反向最近邻查询算法、双色连续反向最近邻查询算法以及相关定理,对算法正确性和可终止性进行证明,分析算法时间复杂性。按照移动点Voronoi图的拓扑结构是否改变分为2种情况,分析每种情况下候选所在区域的变化,在变化区域内进行Voronoi图的重构,得到对应的解决方法。在多数情况下,该算法只需生成局部移动点的Voronoi图即可找到结果,减小了连续反向最近邻查询的代价。 相似文献
3.
随着无线通讯技术的发展,移动对象的查询有广阔的应用空间.针对现有反向最近邻算法很多都是基于静态对象的情况,提出了一种新的基于移动对象的反向最近邻的算法--以TPR-tree为索引结构,对原有的半平面修剪策略进行了改进,使其性能优化,并采用过滤验证这两个处理步骤来获取移动查询点的反向最近邻,实现了移动对象的动态反向最近邻的查询. 相似文献
4.
5.
论文提出一种等和值块扩展最近邻矢量量化码字搜索算法。该算法将码书按和值大小排序分块,并将每一块中间或中间附近的码字的和值作为本码书块的特征和值。编码时,查找与输入矢量和值距离最近的码书块并作为初始匹配码书块。然后在该码书块附近上下扩展搜索相邻码书块中距输入矢量最近的码字。该算法具有无复杂运算的特点,易于VLSI技术实现。仿真结果表明,该算法是一种有效的码字搜索算法。 相似文献
6.
随着Wi-Fi、RFID等室内定位技术的发展,产生了越来越多的基于室内空间的位置服务需求。目前已有文献提出了针对室内环境的范围查询和最近邻查询,而双色反向最近邻(bichromatic reverse nearest neighbor,BRNN)查询作为常见的空间查询类型,在室内空间中尚未有相关的研究。为此,提出了基于兴趣点集合的兴趣点融合图模型,并提出了基于路径、基于楼层和基于单元的3种剪枝策略,用于在查询处理时削减搜索空间。在兴趣点融合图和剪枝策略的基础上,提出了室内双色反向最近邻(indoor bichromatic reverse nearest neighbor, IBRNN)查询算法Smart。Smart算法通过对兴趣点融合图中的图元素的检查,从而判断与该图元素关联的移动对象是否有可能属于结果集。最后通过实验,对所提算法的有效性和高效性进行了验证。 相似文献
7.
8.
Batch Nearest Neighbor Search for Video Retrieval 总被引:2,自引:0,他引:2
Jie Shao Zi Huang Heng Tao Shen Xiaofang Zhou Ee-Peng Lim Yijun Li 《Multimedia, IEEE Transactions on》2008,10(3):409-420
To retrieve similar videos to a query clip from a large database, each video is often represented by a sequence of high- dimensional feature vectors. Typically, given a query video containing m feature vectors, an independent nearest neighbor (NN) search for each feature vector is often first performed. After completing all the NN searches, an overall similarity is then computed, i.e., a single content-based video retrieval usually involves m individual NN searches. Since normally nearby feature vectors in a video are similar, a large number of expensive random disk accesses are expected to repeatedly occur, which crucially affects the overall query performance. Batch nearest neighbor (BNN) search is stated as a batch operation that performs a number of individual NN searches. This paper presents a novel approach towards efficient high-dimensional BNN search called dynamic query ordering (DQO) for advanced optimizations of both I/O and CPU costs. Observing the overlapped candidates (or search space) of a pervious query may help to further reduce the candidate sets of subsequent queries, DQO aims at progressively finding a query order such that the common candidates among queries are fully utilized to maximally reduce the total number of candidates. Modelling the candidate set relationship of queries by a candidate overlapping graph (COG), DQO iteratively selects the next query to be executed based on its estimated pruning power to the rest of queries with the dynamically updated COG. Extensive experiments are conducted on real video datasets and show the significance of our BNN query processing strategy. 相似文献
9.
《计算机科学与探索》2017,(6):908-920
为了弥补现有的研究成果无法有效地处理路网环境下基于线段的反k最近邻问题的不足,提出了在路网环境下线段反k最近邻查询方法。该查询方法主要应用于评估查询对象的影响范围。根据路网及Voronoi图的特点提出了网络线段Voronoi图的概念。在静态数据集情况下利用网络线段Voronoi图的性质提出了STA_RVLRk NN算法,查询包括过滤过程和精炼过程两大部分。进一步,在动态数据集的情况下提出了DYN_RVLRk NN算法,查询分为空间线段对象增加和删除两种情况,并对不同的情况给出了相应的算法,得到查询结果集。理论研究和实验表明,所提算法能有效地处理路网中基于线段的反k最近邻问题。 相似文献
10.
反向最近邻查询是空间数据库空间查询的研究热点。目前反向最近邻查询的查询粒度都是基于一维的点,在一些空间物体不能抽象为点的情况下将其抽象为点进行反向最近邻查询,查询结果不能达到一定的精度。该文在分析基于平面线段的最近邻查询和R树结构的基础上提出了一种改进的R树—Rcd树,并给出了基于Rcd树的平面线段反向最近邻查询算法,该方法能实现平面线段的反向最近邻查询。 相似文献
11.
World Wide Web - Reverse k Nearest Neighbor (RkNN) queries retrieve all objects that consider the query as one of their k most influential objects. Given a set of user U, a set of facilities F and... 相似文献
12.
反向最近邻查询是空间数据库空间查询的研究热点。目前反向最近邻查询的查询粒度都是基于一维的点.在一些空间物体不能抽象为点的情况下将其抽象为点进行反向最近邻查询,查询结果不能达到一定的精度。该文在分析基于平面线段的最近邻查询和R树结构的基础上提出了一种改进的R树-Rcd树,并给出了基于Rcd树的平面线段反向最近邻查询算法.该方法能实现平面线段的反向最近邻查询。 相似文献
13.
Many data centers have archived a tremendous amount of data and begun to publish them on the Web. Due to limited resources
and large amount of service requests, data centers usually do not directly support high-cost queries. On the other hand, users
are often overwhelmed by the huge data volume and cannot afford to download the whole data sets and search them locally. To
support high-dimensional nearest neighbor searches in this environment, the paper develops a multi-level approximation scheme.
The coarsest-level approximations are stored locally and searched first. The result is then refined gradually via accesses
to remote data centers. Data centers need only to deliver data items or their precomputed finer level approximations by their
identifiers.
The searching process is usually long in this environment, since it involves remote sites. This paper describes an online
search process: the system periodically reports a data item and a positive integer M. The reported item is guaranteed to be one of the M nearest neighbors of the query one. The paper proposes two algorithms to minimize M in each period. Experiments show that one of them performs similarly as a theoretical a posteriori algorithm and significantly
outperforms the online extensions of two state-of-the-art nearest neighbor search methods.
Received 25 July 2000 / Revised 25 July 2001 / Accepted in revised form 16 October 2001
Correspondence and offprint requests to: Xiaoyang Sean Wang, Department of Information and Software Engineering, George Mason University, Fairfax, VA 22030, USA.
Email: xywang@gmu.eduau 相似文献
14.
本文针对大规模高维数据近邻检索中的瓶颈问题,提出基于向量量化的一种检索方法—簇内乘积量化树方法.该方法运用向量量化和乘积量化的多层树状结构高效表征大规模高维数据集,与现有方法相比降低了索引表空桶率;其次提出基于贪心队列的近邻簇筛选方法减小了计算复杂度,加快了近邻检索速度;最后提出面量化方法用于近似计算候选数据集向量与查询向量间的距离,与点量化和线量化方法相比量化误差更小,提高了近邻查询准确率.本文提出的簇内乘积量化树算法在算子Sift和Gist描述的大规模高维数据集上与乘积量化树技术相比,首次召回准确率提高了57.7%,索引表空桶率降低幅度在50%以上,与局部优化乘积量化技术相比,查全率高达97%,而查询时间却仅需原来的1/9.实验结果表明本文提出的基于簇内乘积量化的近邻方法提升了近邻检索性能,为大规模高维数据集近邻检索提供了理论支持. 相似文献
15.
Nearest Neighbor (NN) search has been in the core of spatial and spatiotemporal database research during the last decade.
The literature on NN query processing algorithms so far deals with either stationary or moving query points over static datasets
or future (predicted) locations over a set of continuously moving points. With the increasing number of Mobile Location Services
(MLS), the need for effective k-NN query processing over historical trajectory data has become the vehicle for data analysis, thus improving existing or
even proposing new services. In this paper, we investigate mechanisms to perform NN search on R-tree-like structures storing
historical information about moving object trajectories. The proposed (depth-first and best-first) algorithms vary with respect
to the type of the query object (stationary or moving point) as well as the type of the query result (historical continuous
or not), thus resulting in four types of NN queries. We also propose novel metrics to support our search ordering and pruning
strategies. Using the implementation of the proposed algorithms on two members of the R-tree family for trajectory data (namely,
the TB-tree and the 3D-R-tree), we demonstrate their scalability and efficiency through an extensive experimental study using
large synthetic and real datasets.
相似文献
Yannis Theodoridis (Corresponding author)Email: URL: http://dke.cti.gr http://isl.cs.unipi.gr/db |
16.
17.
基于VAR树的反向最近邻查询技术的研究 总被引:1,自引:0,他引:1
在空间数据库中,反向最近邻查询技术是最重要的查询技术之一,它是在最近邻查询技术的基础上提出的,如何有效地实现反向最近邻查询一直是人们研究的热点.以往都是基于类似R树索引结构的查询,在高维的情况下,使查询的速度急剧下降,形成"维数灾难".因此引用了一种新的索引结构--VAR树,并对VAR树进行了改进,引进了性能优越的SR树,并给出了基于这种索引结构的最近邻和反最近邻查询的算法.经实验验证基于VAR树的反向最近邻查询算法,在高维空间中的查询效率有了较大的提高. 相似文献
18.
Reverse Nearest Neighbors Search in Ad Hoc Subspaces 总被引:1,自引:0,他引:1
Man Lung Yiu Nikos Mamoulis 《Knowledge and Data Engineering, IEEE Transactions on》2007,19(3):412-426
Given an object q, modeled by a multidimensional point, a reverse nearest neighbors (RNN) query returns the set of objects in the database that have q as their nearest neighbor. In this paper, we study an interesting generalization of the RNN query, where not all dimensions are considered, but only an ad hoc subset thereof. The rationale is that 1) the dimensionality might be too high for the result of a regular RNN query to be useful, 2) missing values may implicitly define a meaningful subspace for RNN retrieval, and 3) analysts may be interested in the query results only for a set of (ad hoc) problem dimensions (i.e., object attributes). We consider a suitable storage scheme and develop appropriate algorithms for projected RNN queries, without relying on multidimensional indexes. Given the significant cost difference between random and sequential data accesses, our algorithms are based on applying sequential accesses only on the projected atomic values of the data at each dimension, to progressively derive a set of RNN candidates. Whether these candidates are actual RNN results is then validated via an optimized refinement step. In addition, we study variants of the projected RNN problem, including RkNN search, bichromatic RNN, and RNN retrieval for the case where sequential accesses are not possible. Our methods are experimentally evaluated with real and synthetic data 相似文献
19.
Charu C. Aggarwal 《Data mining and knowledge discovery》2006,13(1):89-117
Nearest Neighbor search is an important and widely used technique in a number of important application domains. In many of
these domains, the dimensionality of the data representation is often very high. Recent theoretical results have shown that
the concept of proximity or nearest neighbors may not be very meaningful for the high dimensional case. Therefore, it is often
a complex problem to find good quality nearest neighbors in such data sets. Furthermore, it is also difficult to judge the
value and relevance of the returned results. In fact, it is hard for any fully automated system to satisfy a user about the
quality of the nearest neighbors found unless he is directly involved in the process. This is especially the case for high
dimensional data in which the meaningfulness of the nearest neighbors found is questionable. In this paper, we address the
complex problem of high dimensional nearest neighbor search from the user perspective by designing a system which uses effective
cooperation between the human and the computer. The system provides the user with visual representations of carefully chosen
subspaces of the data in order to repeatedly elicit his preferences about the data patterns which are most closely related
to the query point. These preferences are used in order to determine and quantify the meaningfulness of the nearest neighbors.
Our system is not only able to find and quantify the meaningfulness of the nearest neighbors, but is also able to diagnose
situations in which the nearest neighbors found are truly not meaningful.
相似文献
Charu C. AggarwalEmail: |
20.
近年来,基于位置的服务获得了越来越广泛的关注,其中最近邻查询是最常用的一种查询方式.测量手段的不准确性以及数据本身的性质导致不确定性在位置数据中普遍存在,这种不确定性会对最近邻查询结果产生影响.空间中障碍物的存在也给空间数据查询带来了挑战.文中研究存在障碍物的空间中不确定对象连续最近邻查询的处理方法,设计了一种剪枝策略大幅降低需要计算的不确定对象数目,并进一步提出了障碍空间中不确定对象最近邻查询安全区域的概念及安全区域生成算法.设计了安全区域的索引存储方法.实验结果表明,文章所提出的方法具有良好的效率和可扩展性. 相似文献