首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
基于密度的离群噪声点检测   总被引:1,自引:0,他引:1  
张毅  刘旭敏  关永 《计算机应用》2010,30(3):802-805
针对三维扫描仪获取的带噪声和离群点的点云数据,提出了基于局部离群点概念的去噪算法。通过k-近邻(KNN)搜索建立散乱点之间的拓扑关系,进而计算当前测点的局部离群因子以衡量该点的离群程度,从而限制噪声并剔除离群点。重点解决了高密度扫描点云周围分布的低密度离群噪声点的识别问题。实验结果证明,该算法能有效检测出紧挨模型边界的噪声点,并最大限度地保持模型边界。  相似文献   

2.
杨昊  陈雷霆  邱航 《计算机应用》2016,36(10):2826-2831
针对排序统计类降噪算法在随机脉冲噪声(RVIN)图像降噪过程中,对图像边缘和细节部分噪声识别不够准确以及恢复比较模糊的问题,提出了基于加权空间离群点度量(SLOM)的脉冲噪声降噪算法WSLOM-EPR。该算法以优化的空间距离差为基础,引入图像邻域均值和标准差,建立反映局部边缘细节特征的噪声检测方法,提高边缘细节处噪声的识别精度;然后以精确检测结果为基础,优化保边正则(EPR)函数,提高算法的执行效率,并增强算法保留边缘细节的能力。仿真结果显示,WSLOM-EPR算法在40%到60%噪声密度下对噪声点的误检和漏检综合表现优于对比算法,且能在两者之间保持一个较好的平衡;降噪后的峰值信噪比(PSNR)好于对比算法中的大多数情况,且边缘细节在视觉上更加清晰连续。结果表明WSLOM-EPR算法提高了噪声检测精度,有效地保持了恢复图像的边缘细节信息。  相似文献   

3.
LDC-mine——基于局部偏差系数的孤立点挖掘算法   总被引:1,自引:0,他引:1  
孤立点检测一直是知识发现(KDD)中一个活跃的领域,如信用卡欺诈,入侵检测等。在这些应用领域中研究孤立点的异常行为能够发现隐藏在数据集中更有价值的知识。提出了一个新的度量LDC(局部偏差系数)因子和基于LDC的孤立点挖掘的算法LDC-mine。实验证明:该算法能够有效地检测出孤立点。  相似文献   

4.
局部空间离群点算法的改进及其实现   总被引:1,自引:0,他引:1       下载免费PDF全文
LOF算法是一个著名的局部离群点查找方法,该方法赋予了表征每一个空间点偏离程度的数值。但LOF算法存在效率低和性能差的问题,为此对该算法进行了以下两个方面的改进:第一,提出了降低该算法时间复杂度的两步改进方法,并对这两步改进方法的时间复杂度也进行详细分析,第二,使得该算法在查找局部离群点时,不仅考虑了空间属性,也考虑了非空间属性。另外还通过实验测试了LOF算法及其改进方法的时间效率,以及在模拟数据和真实数据情况下的查找离群点的效果。实验结果表明,改进方法具有更好的时间效率和性能。  相似文献   

5.
In what follows, we introduce the notion of representational information (information conveyed by sets of dimensionally defined objects about their superset of origin) as well as an original deterministic mathematical framework for its analysis and measurement. The framework, based in part on categorical invariance theory [30], unifies three key constructs of universal science - invariance, complexity, and information. From this unification we define the amount of information that a well-defined set of objects R carries about its finite superset of origin S, as the rate of change in the structural complexity of S (as determined by its degree of categorical invariance), whenever the objects in R are removed from the set S. The measure captures deterministically the significant role that context and category structure play in determining the relative quantity and quality of subjective information conveyed by particular objects in multi-object stimuli.  相似文献   

6.
Spatial indexing on flash-based Solid State Drives (SSDs) has become a core aspect in spatial database applications, and has been carried out by flash-aware spatial indices. Although there are some flash-aware spatial indices proposed in the literature, they do not exploit all the benefits of SSDs, leading to loss of efficiency and durability. In this article, we propose eFIND, a new generic and efficient framework for flash-aware spatial indexing. eFIND takes into account the intrinsic characteristics of SSDs by employing (i) a write buffer to avoid expensive random writes, (ii) a flushing algorithm that smartly picks modifications to be flushed in batch to the SSD, (iii) a read buffer to decrease the overhead of random reads, (iv) a temporal control to avoid interleaved reads and writes, and (v) a log-structured approach to provide data durability. Performance tests showed the efficiency of eFIND. Compared to the state of the art, eFIND improved the construction of spatial indices from 43% to 77%, and the spatial query processing from 4% to 23%.  相似文献   

7.
Authenticated indexing for outsourced spatial databases   总被引:1,自引:0,他引:1  
In spatial database outsourcing, a data owner delegates its data management tasks to a location-based service (LBS), which indexes the data with an authenticated data structure (ADS). The LBS receives queries (ranges, nearest neighbors) originating from several clients/subscribers. Each query initiates the computation of a verification object (VO) based on the ADS. The VO is returned to the client that can verify the result correctness using the public key of the owner. Our first contribution is the MR-tree, a space-efficient ADS that supports fast query processing and verification. Our second contribution is the MR*-tree, a modified version of the MR-tree, which significantly reduces the VO size through a novel embedding technique. Finally, whereas most ADSs must be constructed and maintained by the owner, we outsource the MR- and MR*-tree construction and maintenance to the LBS, thus relieving the owner from this computationally intensive task.  相似文献   

8.
Spatial databases are essential to applications in a wide variety of domains. One of the main privacy concerns when answering statistical queries, such as range counting queries, over a spatial database is that an adversary observing changes in query answers may be able to determine whether or not a particular geometric object is present in the database. Differential privacy addresses this concern by guaranteeing that the presence or absence of a geometric object has little effect on query answers. Most of the current differentially private mechanisms for spatial databases ignore the fact that privacy is personal and, thus, provide the same privacy protection for all geometric objects. However, some particular geometric objects may be more sensitive to privacy issues than others, requiring stronger differential privacy guarantees. In this paper, we introduce the concept of spatial personalized differential privacy for spatial databases where different geometric objects have different privacy protection requirements. Also, we present SPDP-PCE, a novel spatial personalized differentially private mechanism to answer range counting queries over spatial databases that fully considers the privacy protection requirements of geometric objects in the underlying geometric space in both steps of noise addition and consistency enforcement. Our experimental results on real datasets demonstrate the effectiveness of SPDP-PCE under various total privacy budgets, query shapes, and privacy level distributions.  相似文献   

9.
Haibin Sun   《Knowledge》2009,22(6):403-409
The problem of spatial configuration information retrieval is a constraint satisfaction problem (CSP), which can be solved using traditional CSP algorithms. But the spatial data can be reorganized using index techniques like R-tree and the spatial data are approximated by their minimum bounding rectangles (MBRs), so the spatial configuration information retrieval is actually based on the MBRs and some special techniques can be studied. This paper studies the mapping relationships among the spatial relations for real spatial objects, the corresponding spatial relations for their MBRs and the corresponding spatial relations between the intermediate nodes and the MBRs in R-tree. Three algorithms are designed and studied, and their performances are compared.  相似文献   

10.
IncLOF:动态环境下局部异常的增量挖掘算法   总被引:12,自引:1,他引:12  
异常检测是数据挖掘领域研究的最基本的问题之一,它在欺诈甄别、贷款审批、气象预报、客户分类等方面有广泛的应用,以前的异常检测算法只适应于静态环境,在数据更新时需要进行重新计算,在基于密度的局部异常检测算法LOF的基础上,提出一种在动态环境下局部异常挖掘的增量算法IncLOF,当数据库中的数据更新时,只对受到影响的点进行重新计算,这样可以大大提高异常的挖掘速度,实验表明,在动态环境下IncLOF的运行时间远远小于LOF的运行时间,并且用户定义的邻域中的最小对象个数与记录数之比越小,效果越明显.  相似文献   

11.
Anew local control spline based on shape parameterw with G^3 continuity,called BLC-spline,is pro* posed.Not only is BLC-spline very smoot,but also the spline curve‘s characteristic polygon has only three control vertices,and the characteristic polyhedron has only nine control vertices.The behavior of Iocal control of BLC-spline is better than that of the other splines such as cubic Bezier,B and Beta-spline.The three shape parameters β0,β1and β2 of BLC-spline,which are independent of the control vertices,may be altered to change the shape of the curve or surface.It is shown that BLC-spline may be used to construcet a space are spline for DNC machining directly.That is a powerful tool for the design and manufacture of curves and surfaces in integrated CAD/CAM systems.  相似文献   

12.
A fuzzy clustering problem consists of assigning a set of patterns to a given number of clusters with respect to some criteria such that each of them may belong to more than one cluster with different degrees of membership. In order to solve it, we first propose a new local search heuristic, called Fuzzy J-Means, where the neighbourhood is defined by all possible centroid-to-pattern relocations. The “integer” solution is then moved to a continuous one by an alternate step, i.e., by finding centroids and membership degrees for all patterns and clusters. To alleviate the difficulty of being stuck in local minima of poor value, this local search is then embedded into the Variable Neighbourhood Search metaheuristic. Results on five standard test problems from the literature are reported and compared with those obtained with the well-known Fuzzy C-Means heuristic. It appears that solutions of substantially better quality are obtained with the proposed methods than with this former one.  相似文献   

13.
针对传统空间离群点检测算法构建邻域时参数选择困难,处理高维数据的时间复杂度较高等问题,提出了一种基于地统计学的空间离群点检测算法。该算法将空间自相关理论引入空间离群检测中,首先利用3σ规则识别全局离群点,然后利用Delaunay三角网构建空间邻域,将邻域节点均值代替全局离群点,最后使用局部Moran’ I作为空间异常的度量方法。仿真结果表明,该方法不需要选择参数,鲁棒性较强,检测率较高、误警率较低。  相似文献   

14.
This paper investigates an improved local update scheme for cellular automata (CA) applied to structural design. Local analysis and design rules are derived for equilibrium and minimum compliance design. The new update scheme consists of repeating analysis and optimality-based design rules locally. The benefits of this approach are demonstrated through a series of systematic experiments. Truss topology design problems of various sizes are used based on the Gauss–Seidel and the Jacobi iteration modes. Experiments show the robust convergence of the approach as compared to an earlier CA implementation. The approach is also extended to a plate problem.  相似文献   

15.
Besides traditional domains (e.g., resource allocation, data mining applications), algorithms for medoid computation and related problems will play an important role in numerous emerging fields, such as location based services and sensor networks. Since the k-medoid problem is NP-hard, all existing work deals with approximate solutions on relatively small datasets. This paper aims at efficient methods for very large spatial databases, motivated by: (1) the high and ever increasing availability of spatial data, and (2) the need for novel query types and improved services. The proposed solutions exploit the intrinsic grouping properties of a data partition index in order to read only a small part of the dataset. Compared to previous approaches, we achieve results of comparable or better quality at a small fraction of the CPU and I/O costs (seconds as opposed to hours, and tens of node accesses instead of thousands). In addition, we study medoid-aggregate queries, where k is not known in advance, but we are asked to compute a medoid set that leads to an average distance close to a user-specified value. Similarly, medoid-optimization queries aim at minimizing both the number of medoids k and the average distance. We also consider the max version for the aforementioned problems, where the goal is to minimize the maximum (instead of the average) distance between any object and its closest medoid. Finally, we investigate bichromatic and weighted medoid versions for all query types, as well as, maximum capacity and dynamic medoids.  相似文献   

16.
Owing to various manufacturing strategies and demands of markets, chemical processes often involve multiple operating modes. How to identify mode from multimode process data collected under both stable and transitional modes is an important issue. This paper proposes a novel mode identification algorithm-recursive local outlier factor (RLOF) based on the sequential information in the time scale and the density information in the spatial scale. In this algorithm, not only the number of modes does not need to be determined in advance, but also details of mode switching can be acquired. In addition, the principal components (PCs) chosen by the variance of overall dataset in principal component analysis (PCA) cannot guarantee that all variables express information as completely as possible. Using the defined cumulative percent expression (CPE), this study chooses key PCs (KPCs) according to each variable. Moreover, fault diagnosis is realized via the contribution of every variable to key PCs. Finally, the monitoring performance is evaluated under the Tennessee Eastman (TE) benchmark and the continuous stirred tank reactor (CSTR) process.  相似文献   

17.
A new measure of consistency for positive reciprocal matrices   总被引:4,自引:0,他引:4  
The analytic hierarchy process (AHP) provides a decision maker with a way of examining the consistency of entries in a pairwise comparison matrix and the hierarchy as a whole through the consistency ratio measure. It has always seemed to us that this commonly used measure could be improved upon. The purpose of this paper is to present an alternative consistency measure and demonstrate how it might be applied in different types of matrices.  相似文献   

18.
基于局部搜索惯性权重的粒子群优化算法*   总被引:2,自引:1,他引:1  
粒子群优化算法的性能主要受其中参数的影响,尤其是惯性权重的影响,选择合理的ω能够平衡算法的全局和局部搜索能力.根据当前粒子的函数值调整学习因子,利用局部搜索的方法确定惯性权重,提高了算法的鲁棒性能.最后对一些标准测试函数进行验证,实验分析表明该算法具有优越性能.  相似文献   

19.
Many validity indices have been proposed for quantitatively assessing the performance of clustering algorithms. One limitation of existing indices is their lack of generalizability, due to their dependence on the specific algorithms and structures of the data space. To handle large-scale datasets with arbitrary structures, this research study proposes a new cluster separation measure for improving the effectiveness of existing validity indices. This is achieved by partitioning the original data space into a grid-based structure which allows the introduction of a new measurement for assessing the true data distribution between any two clusters instead of the distance between the two cluster prototypes. To validate the effectiveness of the proposed separation measure, we adopt two commonly used validity indices, the Davies-Bouldin’s function (DB) and Tibshirani’s Gap statistic (GS). These indices are denoted as R-DB-1 and R-GS-1 for clusters with sphere-shaped structures and R-DB-2 and R-GS-2 for irregular-shaped structures. This integration enables the indices to evaluate both partitional algorithms and hierarchical algorithms. Partitional algorithms including C-Means (CM), Fuzzy C-Means (FCM), and hierarchical algorithms, including DBSCAN and CLIQUE, are used to test the performance of the new indices. Two synthetic datasets with spherical structures and four synthetic datasets with irregular shapes are first compared. Five real datasets from the UCI machine learning repository are then used to further test the measure’s performance. The experimental results provide evidence that the new indices outperform the original indices.  相似文献   

20.
In this paper, we propose a modified version of the k-nearest neighbor (kNN) algorithm. We first introduce a new affinity function for distance measure between a test point and a training point which is an approach based on local learning. A new similarity function using this affinity function is proposed next for the classification of the test patterns. The widely used convention of k, i.e., k = [√N] is employed, where N is the number of data used for training purpose. The proposed modified kNN algorithm is applied on fifteen numerical datasets from the UCI machine learning data repository. Both 5-fold and 10-fold cross-validations are used. The average classification accuracy, obtained from our method is found to exceed some well-known clustering algorithms.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号