首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
K‐Nearest neighbor (K‐NN) algorithm is a classification algorithm widely used in machine learning, statistical pattern recognition, data mining, etc. Ordered weighted averaging (OWA) distance based CxK nearest neighbor algorithm is a kind of K‐NN algorithm based on OWA distance. In this study, the aim is two‐fold: i) to perform the algorithm with two different fuzzy metric measures, which are Diamond distance, and weighted dissimilarity measure composed by spread distances and center distances, and ii) to evaluate the effects of different metric measures. K neighbors are searched for each class, and OWA distance is used to aggregate the information. The OWA distance can behave as intercluster distance approaches single, complete, and average linkages by using different weights. The experimental study is performed on well‐known three classification data sets (iris, glass, and wine). N‐fold cross‐validation is used for the evaluation of performances. It is seen that single linkage approach by using two different metric measures has significant different results.  相似文献   

2.
《控制论与系统》2012,43(1):27-58
Abstract

The aim of the paper is to develop new aggregation operators using Bonferroni means, ordered weighted averaging (OWA) operators and some distance measures. We introduce the Bonferroni-Hamming weighted distance (BON-HWD), Bonferroni OWA distance (BON-OWAD), Bonferroni OWA adequacy coefficient (BON-OWAAC) and Bonferroni distances with OWA operators and weighted averages (BON-IWOWAD). The main advantages of using these operators are that they allow the consideration of different aggregations contexts to be considered and multiple comparison between each argument and distance measures in the same formulation. An application is developed using these new algorithms in combination with Pichat algorithm to solve a group decision-making problem. Creative personality is taken as an example for forming creative groups. The results show fuzzy dissimilarity relations in order to establish the maximum similarity subrelations and find groups according to each individual’s creative personality similarities.  相似文献   

3.
Step‐like or Hurwicz‐like ordered weighted averaging (OWA) (S‐H OWA) operators connect two fundamental OWA operators, step OWA operators and Hurwicz OWA operators. S‐H OWA operators also generalize them and some other well‐know OWA operators such as median and centered OWA operators. Generally, there are two types of determination methods for S‐H OWA operators: One is from the motivation of some existed mathematical results; the other is by a set of “nonstrict” definitions and often via some intermediate elements. For the second type, in this study we define two sets of strict definitions for Hurwitz/step degree, which are more effective and necessary for theoretical studies and practical usages. Both sets of definitions are useful in different situations. In addition, they are based on the same concept moment of OWA operators proposed in this study, and therefore they become identical in limit forms. However, the Hurwicz/step degree (HD/SD) puts more concerns on its numerical measure and physical meaning, whereas the relative Hurwicz/step degree (rHD/rSD), still being accurate numerically, sometimes is more reasonable intuitively and has larger potential in further studies and practical applications.  相似文献   

4.
An important research issue in multimedia databases is the retrieval of similar objects. For most applications in multimedia databases, an exact search is not meaningful. Thus, much effort has been devoted to develop efficient and effective similarity search techniques. A recent approach that has been shown to improve the effectiveness of similarity search in multimedia databases resorts to the usage of combinations of metrics (i.e., a search on a multi-metric space). In this approach, the desirable contribution (weight) of each metric is chosen at query time. It follows that standard metric indexes cannot be directly used to improve the efficiency of dynamically weighted queries, because they assume that there is only one fixed distance function at indexing and query time. This paper presents a methodology for adapting metric indexes to multi-metric indexes, that is, to support similarity queries with dynamic combinations of metric functions. The adapted indexes are built with a single distance function and store partial distances to estimate the dynamically weighed distances. We present two novel indexes for multimetric space indexing, which are the result of the application of the proposed methodology.  相似文献   

5.
6.
New quantum distance is introduced as a half-sum of several singular values of difference between two density operators. This is, up to factor, the metric induced by so-called Ky Fan norm. The partitioned trace distances enjoy similar properties to the standard trace distance, including the unitary invariance, the strong convexity and the close relations to the classical distances. The partitioned distances cannot increase under quantum operations of certain kind including bistochastic maps. All the basic properties are re-formulated as majorization relations. Possible applications to quantum information processing are briefly discussed.  相似文献   

7.
A very important class of queries in GIS applications is the class of K-nearest neighbor queries. Most of the current studies on the K-nearest neighbor queries utilize spatial index structures and hence are based on the Euclidean distances between the points. In real-world road networks, however, the shortest distance between two points depends on the actual path connecting the points and cannot be computed accurately using one of the Minkowski metrics. Thus, the Euclidean distance may not properly approximate the real distance. In this paper, we apply an embedding technique to transform a road network to a high dimensional space in order to utilize computationally simple Minkowski metrics for distance measurement. Subsequently, we extend our approach to dynamically transform new points into the embedding space. Finally, we propose an efficient technique that can find the actual shortest path between two points in the original road network using only the embedding space. Our empirical experiments indicate that the Chessboard distance metric (L) in the embedding space preserves the ordering of the distances between a point and its neighbors more precisely as compared to the Euclidean distance in the original road network.  相似文献   

8.
The ordered weighted averaging (OWA) operator was introduced by Yager. 1 The fundamental aspect of the OWA operator is a reordering step in which the input arguments are rearranged in descending order. In this article, we propose two new classes of aggregation operators called ordered weighted geometric averaging (OWGA) operators and study some desired properties of these operators. Some methods for obtaining the associated weighting parameters are discussed, and the relationship between the OWA and DOWGA operators is also investigated. © 2002 Wiley Periodicals, Inc.  相似文献   

9.
In this study, we propose the concept of piled ordered weighted averaging (OWA) operators, which generalize the centered OWA operators and also connect the step OWA operators with the Hurwicz OWA operators with given the orness degree. We propose a controllable algorithm to generate the family of piled OWA operators depending on their predefined three parameters: orness degree, step‐like or Hurwicz‐like degree, and the numbers of “supporting” vectors. By these preferences, we can generate infinite more piled OWA operators with miscellaneous forms, and each of them is similar to the well‐known binomial OWA operator, which is very useful but only has one form corresponding to one given orness degree.  相似文献   

10.
This paper deals with OWA (ordered weighted average) operators defined on an arbitrary finite lattice endowed with a t-norm and a t-conorm. A qualitative orness measure for any OWA operator is suggested, based on its proximity to the OR operator that yields the maximum of the given data. In the particular case of a finite distributive lattice, considering the t-norm given by the meet and the t-conorm given by the join, this qualitative measure agrees with the value that some discrete Sugeno integral takes on the vector consisting of all the members of the lattice. Some applications of the qualitative orness of OWA operators to decision-making problems are shown. In addition, OWA operators defined on a finite product lattice are also applied in image processing. We analyze the effect of several OWA operators with respect to their orness.  相似文献   

11.
One of the most common techniques to find the adequate weights in ordered weighted averaging (OWA) operators is based on the orness concept, where the weights are determined by maximizing the entropy (variation) for a fixed orness value. But such an entropy represents a dispersion measure for nominal variables, while weights in an OWA operator are essentially ordinal rather than nominal. Hence, in this paper, we propose a novel way to determine OWA weights based upon ordinal dispersion measures instead of an standard entropy measure. From this approach, we find an explicit formula for the weights, and we illustrate differences by means some multicriteria decision-making examples.  相似文献   

12.
We discuss the OWA (Ordered Weighted Averaging) operators which provide for aggregation operations lying between the and and the or, and the BADD (BAsic Defuzzification Distribution) transformation. This transformation plays a central role in the development of defuzzification procedures, the aggregation of elements in a fuzzy subset to obtain one representative element. We suggest the use of the BADD transformation to generate a class of OWA like aggregation operators, denoted BADD-OWA operators. We show that while these BADD-OWA operators have some interesting properties, they fail to be mono-tonic. We next use another family of defuzzification operators, called SLIDE, to generate another class of OWA operators. These S-OWA operators provide two subfamilies of OWA operators, one Or-like and the other And-Like.  相似文献   

13.
Patterns are formalized as operators, which may be compared on the basis of an equivalence relation, of a metricd, and of probability measures. The general pattern recognition problem for metric and probabilistic patterns is then formulated, referring to the formalism of U. Grenander; identification and detection are shown to be special cases hereof. A list of possible distancesd between patterns is then given, including a suggested distance between line patterns; wider applications of some of these distances are suggested. This paper has a theoretical concern, but it nevertheless aims at letting pattern recognition draw benefit from techniques which are similar in theory, but which are curiously still considered as distinct in practice.  相似文献   

14.
We study the properties of OWA multiplication monoid. By introducing and‐accumulation vectors of OWA operators, we consider the set of all n‐dimensional OWA operators as a lattice. Then, we analyze some monotonicity properties of OWA operators based on an ordering induced by and‐accumulation vectors. We also show that the lattice‐theoretical operations are a kind of counterpart of the OWA multiplication monoid (and its dual, additive monoid). An example of using the OWA multiplication monoid and the lattice‐theoretical structure in decision making problem is provided.  相似文献   

15.
In many advanced applications, data are described by multiple high-dimensional features. Moreover, different queries may weight these features differently; some may not even specify all the features. In this paper, we propose our solution to support efficient query processing in these applications. We devise a novel representation that compactly captures f features into two components. The first component is a 2D vector that reflects a distance range (minimum and maximum values) of the f features with respect to a reference point (the center of the space) in a metric space and the second component is a bit signature, with two bits per dimension, obtained by analyzing each feature's descending energy histogram. This representation enables two levels of filtering: the first component prunes away points that do not share similar distance ranges, while the bit signature filters away points based on the dimensions of the relevant features. Moreover, the representation facilitates the use of a single index structure to further speed up processing. We employ the classical B/sup +/-tree for this purpose. We also propose a KNN search algorithm that exploits the access orders of critical dimensions of highly selective features and partial distances to prune the search space more effectively. Our extensive experiments on both real-life and synthetic data sets show that the proposed solution offers significant performance advantages over sequential scan and retrieval methods using single and multiple VA-files.  相似文献   

16.
Existing clustering-based methods for segmentation and fiber tracking of diffusion tensor magnetic resonance images (DT-MRI) are based on a formulation of a similarity measure between diffusion tensors, or measures that combine translational and diffusion tensor distances in some ad hoc way. In this paper we propose to use the Fisher information-based geodesic distance on the space of multivariate normal distributions as an intrinsic distance metric. An efficient and numerically robust shooting method is developed for computing the minimum geodesic distance between two normal distributions, together with an efficient graph-clustering algorithm for segmentation. Extensive experimental results involving both synthetic data and real DT-MRI images demonstrate that in many cases our method leads to more accurate and intuitively plausible segmentation results vis-à-vis existing methods.  相似文献   

17.
An Index Structure for Data Mining and Clustering   总被引:2,自引:0,他引:2  
In this paper we present an index structure, called MetricMap, that takes a set of objects and a distance metric and then maps those objects to a k-dimensional space in such a way that the distances among objects are approximately preserved. The index structure is a useful tool for clustering and visualization in data-intensive applications, because it replaces expensive distance calculations by sum-of-square calculations. This can make clustering in large databases with expensive distance metrics practical. We compare the index structure with another data mining index structure, FastMap, recently proposed by Faloutsos and Lin, according to two criteria: relative error and clustering accuracy. For relative error, we show that (i) FastMap gives a lower relative error than MetricMap for Euclidean distances, (ii) MetricMap gives a lower relative error than FastMap for non-Euclidean distances (i.e., general distance metrics), and (iii) combining the two reduces the error yet further. A similar result is obtained when comparing the accuracy of clustering. These results hold for different data sizes. The main qualitative conclusion is that these two index structures capture complementary information about distance metrics and therefore can be used together to great benefit. The net effect is that multi-day computations can be done in minutes. Received February 1998 / Revised July 1999 / Accepted in revised form September 1999  相似文献   

18.
We consider the problem of measuring the similarity or distance between two finite sets of points in a metric space, and computing the measure. This problem has applications in, e.g., computational geometry, philosophy of science, updating or changing theories, and machine learning. We review some of the distance functions proposed in the literature, among them the minimum distance link measure, the surjection measure, and the fair surjection measure, and supply polynomial time algorithms for the computation of these measures. Furthermore, we introduce the minimum link measure, a new distance function which is more appealing than the other distance functions mentioned. We also present a polynomial time algorithm for computing this new measure. We further address the issue of defining a metric on point sets. We present the metric infimum method that constructs a metric from any distance functions on point sets. In particular, the metric infimum of the minimum link measure is a quite intuitive. The computation of this measure is shown to be in NP for a broad class of instances; it is NP-hard for a natural problem class. Received: 1 July 1994 / 9 November 1995  相似文献   

19.
Searching in metric spaces by spatial approximation   总被引:5,自引:0,他引:5  
We propose a new data structure to search in metric spaces. A metric space is formed by a collection of objects and a distance function defined among them which satisfies the triangle inequality. The goal is, given a set of objects and a query, retrieve those objects close enough to the query. The complexity measure is the number of distances computed to achieve this goal. Our data structure, called sa-tree (“spatial approximation tree”), is based on approaching the searched objects spatially, that is, getting closer and closer to them, rather than the classic divide-and-conquer approach of other data structures. We analyze our method and show that the number of distance evaluations to search among n objects is sublinear. We show experimentally that the sa-tree is the best existing technique when the metric space is hard to search or the query has low selectivity. These are the most important unsolved cases in real applications. As a practical advantage, our data structure is one of the few that does not need to tune parameters, which makes it appealing for use by non-experts. Edited by R. Sacks-Davis Received: 17 April 2001 / Accepted: 24 January 2002 / Published online: 14 May 2002  相似文献   

20.
Ordered weighted averaging (OWA) operators have been widely used in decision making these past few years. An important issue facing the OWA operators' users is the determination of the OWA weights. This paper introduces an OWA determination method based on truncated distributions that enable intuitive generation of OWA weights according to a certain level of risk and trade‐off. These two dimensions are represented by the two first moments of the truncated distribution. We illustrate our approach with the well‐known normal distribution and the definition of a continuous parabolic decision‐strategy space. We finally study the impact of the number of criteria on the results.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号