首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
The consensus clustering technique combines multiple clustering results without accessing the original data. Consensus clustering can be used to improve the robustness of clustering results or to obtain the clustering results from multiple data sources. In this paper, we propose a novel definition of the similarity between points and clusters. With an iterative process, such a definition of similarity can represent how a point should join or leave a cluster clearly, determine the number of clusters automatically, and combine partially overlapping clustering results. We also incorporate the concept of “clustering fragment” into our method for increased speed. The experimental results show that our algorithm achieves good performances on both artificial data and real data.  相似文献   

2.
This paper presents a robust Real-coded evolutionary algorithm. Real-coded evolutionary algorithms (RCEAs), such as real-coded genetic algorithms and evolution strategies, are known as effective methods for function optimization. However, they often fail to find the optimum in case the objective function is multimodal, discrete or high-dimensional. It is also reported that most crossover (or recombination) operators for RCEAs has sampling bias that prevents to find the optimum near the boundary of search space. They like to search the center of search space much more than the other. Therefore, they will not work on functions that have their optima near the boundary of search space. In this paper, we apply two methods, genetic algorithm with search area adaptation (GSA) and toroidal search space conversion (TSC), to the function optimization for improving the robustness of RCEAs. The former method searches adaptively and the latter one removes the sampling bias. Through several experiments, we have confirmed that GSA works adaptively and it shows higher performance, and RCEAs with TSC show effectiveness to find the optima near the boundary of search space.  相似文献   

3.
针对现有鲁棒图形模糊聚类算法难以满足强噪声干扰下大幅面图像快速分割的需要,提出一种快速鲁棒核空间图形模糊聚类分割算法。该算法将欧氏空间样本通过核函数映射至高维空间;采用待分割图像中像素邻域的灰度和空间等信息构建线性加权滤波图像,对其进行鲁棒核空间图形模糊聚类;并引入当前聚类像素与其邻域像素均值所对应的二维直方图信息,获得鲁棒核空间图形模糊聚类快速迭代表达式。对大幅面图像添加高斯和椒盐噪声进行分割测试,实验结果表明:本文算法相比基于图形模糊聚类等分割算法的分割性能、抗噪鲁棒性和实时性有了显著提高。  相似文献   

4.
A robust information clustering algorithm   总被引:1,自引:0,他引:1  
Song Q 《Neural computation》2005,17(12):2672-2698
We focus on the scenario of robust information clustering (RIC) based on the minimax optimization of mutual information (MI). The minimization of MI leads to the standard mass-constrained deterministic annealing clustering, which is an empirical risk-minimization algorithm. The maximization of MI works out an upper bound of the empirical risk via the identification of outliers (noisy data points). Furthermore, we estimate the real risk VC-bound and determine an optimal cluster number of the RIC based on the structural risk-minimization principle. One of the main advantages of the minimax optimization of MI is that it is a nonparametric approach, which identifies the outliers through the robust density estimate and forms a simple data clustering algorithm based on the square error of the Euclidean distance.  相似文献   

5.
鉴于文本数据具有方向性数据的特征,可利用方向数据的知识完成对文本数据聚类,提出了模糊方向相似性聚类算法FDSC,继而从竞争学习角度,通过引入隶属度约束函数,并根据拉格朗日优化理论推导出鲁棒的模糊方向相似性聚类算法RFDSC.实验结果表明RFDSC算法能够快速有效地对文本数据集进行聚类.  相似文献   

6.
This paper addresses three major issues associated with conventional partitional clustering, namely, sensitivity to initialization, difficulty in determining the number of clusters, and sensitivity to noise and outliers. The proposed robust competitive agglomeration (RCA) algorithm starts with a large number of clusters to reduce the sensitivity to initialization, and determines the actual number of clusters by a process of competitive agglomeration. Noise immunity is achieved by incorporating concepts from robust statistics into the algorithm. RCA assigns two different sets of weights for each data point: the first set of constrained weights represents degrees of sharing, and is used to create a competitive environment and to generate a fuzzy partition of the data set. The second set corresponds to robust weights, and is used to obtain robust estimates of the cluster prototypes. By choosing an appropriate distance measure in the objective function, RCA can be used to find an unknown number of clusters of various shapes in noisy data sets, as well as to fit an unknown number of parametric models simultaneously. Several examples, such as clustering/mixture decomposition, line/plane fitting, segmentation of range images, and estimation of motion parameters of multiple objects, are shown  相似文献   

7.
Zhao  Jiaxing  Bo  Ren  Hou  Qibin  Cheng  Ming-Ming  Rosin  Paul 《计算可视媒体(英文)》2018,4(4):333-348
Computational Visual Media - In this paper, we reconsider the clustering problem for image over-segmentation from a new perspective. We propose a novel search algorithm called “active...  相似文献   

8.
In the “first-order reliability method” (FORM), the HL-RF iterative algorithm is a recommended and widely used one to locate the design point and calculate the reliability index. However it may fail to converge if the limit state surface at the design point is highly nonlinear. In this paper, an easy iterative algorithm, which introduces a “new” step length to control the convergence of the sequence and can be named as finite-step-length iterative algorithm, is present. It is proved that the HL-RF method is a special case of this proposed algorithm when the step length tends to infinity and the reason why the HL-RF diverges is illustrated. This proposed algorithm is much easier than other optimization schemes, especially than the modified HL-RF algorithm, because the process of line search for obtaining the step length is not needed. Numerical results indicate that the proposed algorithm is effective and as simple as the HL-RF but more robust.  相似文献   

9.
针对随机节点故障所引发的连锁故障问题,为了尽可能地降低连锁故障对无线传感器网络所造成的损害,提出了一种无线传感器网络无标度容错拓扑的连锁故障诊断算法,该算法基于单一节点故障时负载重新分配给相邻节点的情况,提出一种连锁故障下的负载再分配模型,分析了单一节点故障时所产生的连锁故障规模。采用相邻节点的连锁故障诊断算法来研究传感器网络的负载参数和连锁故障规模之间的关系,尽可能地减少连锁故障所带来的节点损失。仿真结果表明,该算法有效地抑制了由于负载过大所引发的连锁负载效应,在减少网络节点的损失上起到了较好的效果。  相似文献   

10.
A concept-driven algorithm for clustering search results   总被引:3,自引:0,他引:3  
Without search engines, the Internet would be an enormous amount of disorganized information that would certainly be interesting but perhaps not very useful. Search engines help us in all kinds of tasks and are constantly improving result relevance. The Lingo algorithm combines common phrase discovery and latent semantic indexing techniques to separate search results into meaningful groups. It looks for meaningful phrases to use as cluster labels and then assigns documents to the labels to form groups.  相似文献   

11.
We develop a new algorithm for clustering search results. Differently from many other clustering systems that have been recently proposed as a post-processing step for Web search engines, our system is not based on phrase analysis inside snippets, but instead uses latent semantic indexing on the whole document content. A main contribution of the paper is a novel strategy – called dynamic SVD clustering – to discover the optimal number of singular values to be used for clustering purposes. Moreover, the algorithm is such that the SVD computation step has in practice good performance, which makes it feasible to perform clustering when term vectors are available. We show that the algorithm has very good classification performance, and that it can be effectively used to cluster results of a search engine to make them easier to browse by users. The algorithm has being integrated into the Noodles search engine, a tool for searching and clustering Web and desktop documents.  相似文献   

12.
为了解决聚类算法容易陷入局部最优的问题,以及增强聚类算法的全局搜索能力,基于KHM算法以及改进的引力搜索算法,本文提出一种混合K-调和均值聚类算法(G-KHM)。G-KHM算法具有KHM算法收敛速度快的优点,但同时针对KHM算法容易陷入局部最优解的问题,在初始化后数据开始搜索聚类中心时采用了一种基于对象多样性及收敛性增强的引力搜索算法,该方法改进了引力搜索算法容易失去种群多样性的缺点,并同时具有引力搜索算法较强的全局搜索能力,可以使算法收敛到全局最优解。仿真结果表明,G-KHM算法能有效地避免陷入局部极值,具有较强的全局搜索能力以及稳定性,并且相比KHM算法、K-mean聚类算法、C均值聚类算法以及粒子群算法,在分类精度和运行时间上表现出了更好地效果。  相似文献   

13.
Evolutionary algorithms (EAs), which have been widely used to solve various scientific and engineering optimization problems, are essentially stochastic search algorithms operating in the overall solution space. However, such random search mechanism may lead to some disadvantages such as a long computing time and premature convergence. In this study, we propose a space search optimization algorithm (SSOA) with accelerated convergence strategies to alleviate the drawbacks of the purely random search mechanism. The overall framework of the SSOA involves three main search mechanisms: local space search, global space search, and opposition-based search. The local space search that aims to form new solutions approaching the local optimum is realized based on the concept of augmented simplex method, which exhibits significant search abilities realized in some local space. The global space search is completed by Cauchy searching, where the approach itself is based on the Cauchy mutation. This operation can help the method avoid of being trapped in local optima and in this way alleviate premature convergence. An opposition-based search is exploited to accelerate the convergence of space search. This operator can effectively reduce a substantial computational overhead encountered in evolutionary algorithms (EAs). With the use of them SSOA realizes an effective search process. To evaluate the performance of the method, the proposed SSOA is contrasted with a method of differential evolution (DE), which is a well-known space concept-based evolutionary algorithm. When tested against benchmark functions, the SSOA exhibits a competitive performance vis-a-vis performance of some other competitive schemes of differential evolution in terms of accuracy and speed of convergence, especially in case of high-dimensional continuous optimization problems.  相似文献   

14.
为解决无线传感器网络中质心算法对锚节点密度要求较高和定位精度过度依赖锚节点分布的问题,提出了一种多节点协作迭代求精的WSNs加权质心定位算法.该算法采用加权质心估算初始坐标,以请求二跳锚节点的方式增加可用锚节点,由锚节点以多边测距方式估算待定位节点的实际坐标与估算坐标的差值,迭代调整估算坐标,提高定位精度.实验结果表明,与普通加权质心算法相比较,该算法具有更高的定位精度和定位覆盖度.  相似文献   

15.
Clustering is a useful tool for finding structure in a data set. The mixture likelihood approach to clustering is a popular clustering method, in which the EM algorithm is the most used method. However, the EM algorithm for Gaussian mixture models is quite sensitive to initial values and the number of its components needs to be given a priori. To resolve these drawbacks of the EM, we develop a robust EM clustering algorithm for Gaussian mixture models, first creating a new way to solve these initialization problems. We then construct a schema to automatically obtain an optimal number of clusters. Therefore, the proposed robust EM algorithm is robust to initialization and also different cluster volumes with automatically obtaining an optimal number of clusters. Some experimental examples are used to compare our robust EM algorithm with existing clustering methods. The results demonstrate the superiority and usefulness of our proposed method.  相似文献   

16.
A practical method of three-dimensional feature space iterative clustering (3D-FSIC) for image classification has been introduced, in which the clustering iteration is performed in three-dimensional feature space rather than scanning the image pixel by pixel. This method permits the cluster size and pixel frequency to be taken into account so that a more advanced decision rule, the optimal multiple point reassignment (OMPR) can be applied. The paper also provides a simple technique for splitting a cluster based on the first principal component without performing principal component transformation. Finally, a classification example using hue images as well as a discussion of the advantages of using hue images in the 3D-FSIC classification is given.  相似文献   

17.
针对传统的模糊C均值聚类算法在进行图像分割时对孤立点、噪声点敏感性较强,聚类耗时随图像变大而快速增长等缺陷,基于临近元素空间距离的模糊C均值聚类算法即SFGFCM算法,采用核化的空间距离公式,计算出空间临近像素与考察像素的相似度Sij,然后用邻近像素灰度加权和计算出邻近信息制约图像,并进一步在邻近信息制约图像的灰度级统计的基础上进行聚类。该算法考察了临近像素灰度和位置等信息,并且它们之间取得了很好的平衡;不仅表现出较强的鲁棒性且很好地保留了原图像边缘等细节信息,提高了聚类精度,同时大大缩短了大幅图像的聚类时间。通过在合成图像、医学图像及自然图像上的大量实验,与传统算法对比该算法聚类性能明显提高,在图像分割上体现出了较好的分割效果。  相似文献   

18.
软硬件划分问题是软硬件协同设计的重要问题之一,它涉及到系统建模,划分算法和划分方案评价等问题,其中划分算法设计是关键点。以提高系统时间性能为目标,利用任务流图构造系统模型,在其上实现了基于优先权的评价函数,提出了搜索空间平滑技术与离散粒子群算法相结合的软硬件划分算法,并且解决了两者的融合问题,并能根据系统信息动态适应调整算法参数。实验结果表明,算法时间开销稳定,求解质量较高。  相似文献   

19.
针对布谷鸟搜索(CS)算法后期收敛速度慢,传统K-均值算法对初始簇中心选择比较敏感,提出了一种自适应调整的布谷鸟搜索及优化初始K-均值聚类算法(CSSA-OIKM)。首先,由“集群度”与距离均衡优化选择初始簇中心。其次,融合粒子群算法思想,遵循自适应优化学习策略以均衡CS算法全局与局部精细搜索能力。最后,在改进CS算法的基础上引入自适应度调节步长因子与动态变化发现概率,增强算法收敛性能。通过对经典数据集的仿真实验分析,相比K-均值算法、PSO-K-均值算法及CS-K-均值算法来说,提出的CSSA-OIKM算法能有效提高聚类精确性,且算法稳定性好。  相似文献   

20.
In this paper, a genetic clustering algorithm based on dynamic niching with niche migration (DNNM-clustering) is proposed. It is an effective and robust approach to clustering on the basis of a similarity function relating to the approximate density shape estimation. In the new algorithm, a dynamic identification of the niches with niche migration is performed at each generation to automatically evolve the optimal number of clusters as well as the cluster centers of the data set without invoking cluster validity functions. The niches can move slowly under the migration operator which makes the dynamic niching method independent of the radius of the niches. Compared to other existing methods, the proposed clustering method exhibits the following robust characteristics: (1) robust to the initialization, (2) robust to clusters volumes (ability to detect different volumes of clusters), and (3) robust to noise. Moreover, it is free of the radius of the niches and does not need to pre-specify the number of clusters. Several data sets with widely varying characteristics are used to demonstrate its superiority. An application of the DNNM-clustering algorithm in unsupervised classification of the multispectral remote sensing image is also provided.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号