首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
While reducing the dimensionality of a corpus, concept decomposition (CD) based on fuzzy K-means (FKM) clustering provides better approximation than CD based on spherical k-means clustering. However, performance of the FKM algorithm is limited by its distance metric and it is proved that assignment of feature weights can improve the performance of FKM. Our work builds upon this analysis and proposes two approaches to feature weight selection. Using four testing document collections, we demonstrate that the CD based on the proposed feature-weighted FKM provides better approximation than the CD based on FKM while maintaining the quality of retrieval.  相似文献   

2.
基于模糊矢量量化图象编码的研究   总被引:4,自引:0,他引:4       下载免费PDF全文
分析了模糊矢量量化(FVQ)图象编码的原理,给出了FVQ设计三要素。提出了用于图象编码的指数型模糊矢量量化算法(FVQE)。实验结果表明,FVQE的图象编码性能与FVQ相当,但收敛速度要略快于FVQ算法。  相似文献   

3.
动态模糊矢量量化算法   总被引:2,自引:0,他引:2       下载免费PDF全文
由于传统的K-均值算法在用于矢量量化时强烈依赖初始码书的选取,如果初始码书选取不好,则很容易陷入局部最小点;而Bezdek的模糊K-均值算法由于计算量很大,也很少用于矢量量化的设计码书,因此人们一直在寻找收敛速度和收敛效果两者性能较好的算法,在研究Nicolaos等人提出的模糊矢量量化(FVQ)算法基础上,针对FVQ算法收敛过程存在的总理2,并从收敛结构和收敛策略出发,提出了一种动态的法在收敛速度  相似文献   

4.
模糊核聚类的自适应算法   总被引:2,自引:2,他引:2  
李侃  刘玉树 《控制与决策》2004,19(5):595-597
针对模糊聚类算法在样本特征不明显时不能取得很好的聚类效果,以及现有的模糊聚类算法需要事先确定聚类数,随机性强、容易陷入局部最优等弱点,将核函数和有效性函数引入到模糊聚类中,提出了模糊核聚类的自适应算法,此方法在性能上比经典的聚类算法有了较大的改进,取得了更好的聚类效果,实验结果证实了该方法的有效性和可行性.  相似文献   

5.
模式匹配在整个说话人识别系统中具有重要的作用,其采取的方法将直接影响系统的识别率.本文介绍了一种模糊矢量量化(FVQ)方法,通过对模糊C均值(FCM)聚类算法的分析,提出了基于减法聚类和改进的模糊C均值聚类算法相结合的说话人识别方法,实验表明该方法提高了识别率,是一种行之有效的说话人识别方法.  相似文献   

6.
基于模糊连接度的近邻传播聚类图像分割方法   总被引:1,自引:0,他引:1  
杜艳新  葛洪伟  肖志勇 《计算机应用》2014,34(11):3309-3313
针对现有近邻传播聚类图像分割方法分割精度低的问题,提出一种基于模糊连接度的邻近传播聚类(FCAP)图像分割算法。针对传统模糊连接度算法不能得出任意点对间模糊连接度的不足,结合最大生成树提出了全模糊连接度算法。FCAP算法先使用Normalized Cut超像素技术进行超像素分割,这些超像素可以看作数据点以及它们之间的模糊连接度;然后使用所提出的全模糊连接度算法计算超像素间的模糊连接度,根据模糊连接度和空间信息计算超像素的相似度;最后使用近邻传播(AP)聚类算法完成分割。实验结果表明,FCAP算法明显优于超像素处理后直接使用AP聚类算法进行分割的方法,并且优于无监督图像分割方法。  相似文献   

7.
Large graphs are scale free and ubiquitous having irregular relationships. Clustering is used to find existent similar patterns in graphs and thus help in getting useful insights. In real-world, nodes may belong to more than one cluster thus, it is essential to analyze fuzzy cluster membership of nodes. Traditional centralized fuzzy clustering algorithms incur high communication cost and produce poor quality of clusters when used for large graphs. Thus, scalable solutions are obligatory to handle huge amount of data in less computational time with minimum disk access. In this paper, we proposed a parallel fuzzy clustering algorithm named ‘PGFC’ for handling scalable graph data. It will be advantageous from the viewpoint of expert systems to develop a clustering algorithm that can assure scalability along with better quality of clusters for handling large graphs.The algorithm is parallelized using bulk synchronous parallel (BSP) based Pregel model. The cluster centers are initialized using degree centrality measure, resulting in lesser number of iterations. The performance of PGFC is compared with other state of art clustering algorithms using synthetic graphs and real world networks. The experimental results reveal that the proposed PGFC scales up linearly to handle large graphs and produces better quality of clusters when compared to other graph clustering counterparts.  相似文献   

8.
针对非充分数据集及噪声对聚类分析的干扰,基于模糊C均值(FCM)框架下的聚类技术,即一般化的增强模糊划分聚类算法(GIFP-FCM),探讨具有迁移学习能力的聚类方法--融入迁移学习机制的GIFP-FCM算法(T-GIFP-FCM)。该算法通过有效利用历史相关场景(域)总结得到的知识来指导当前场景(域)中信息不足时的聚类任务,从而提高聚类效果。通过在模拟数据集及真实数据集上的仿真实验,结果显示文中算法较之传统算法在处理信息不足任务时具有更佳的性能。  相似文献   

9.
目的 针对现有广义均衡模糊C-均值聚类不收敛问题,提出一种改进广义均衡模糊聚类新算法,并将其推广至再生希尔伯特核空间以便提高该类算法的普适性。方法 在现有广义均衡模糊C-均值聚类目标函数的基础上,利用Schweizer T范数极限表达式的性质构造了新的广义均衡模糊C-均值聚类最优化目标函数,然后采用拉格朗日乘子法获取其迭代求解所对应的隶属度和聚类中心表达式,同时对其聚类中心迭代表达式进行修改并得到一类聚类性能显著改善的修正聚类算法;最后利用非线性函数将数据样本映射至高维特征空间获得核空间广义均衡模糊聚类算法。结果 对Iris标准文本数据聚类和灰度图像分割测试表明,提出的改进广义均衡模模糊聚类新算法及其修正算法具有良好的分类性能,核空间广义均衡模糊聚类算法对比现有融入类间距离的改进模糊C-均值聚类(FCS)算法和改进再生核空间的模糊局部C-均值聚类(KFLICM)算法能将图像分割的误分率降低10%30%。结论 本文算法克服了现有广义均衡模糊C-均值聚类算法的缺陷,同时改善了聚类性能,适合复杂数据聚类分析的需要。  相似文献   

10.
An axiomatic approach to soft learning vector quantization andclustering   总被引:11,自引:0,他引:11  
This paper presents an axiomatic approach to soft learning vector quantization (LVQ) and clustering based on reformulation. The reformulation of the fuzzy c-means (FCM) algorithm provides the basis for reformulating entropy-constrained fuzzy clustering (ECFC) algorithms. According to the proposed approach, the development of specific algorithms reduces to the selection of a generator function. Linear generator functions lead to the FCM and fuzzy learning vector quantization algorithms while exponential generator functions lead to ECFC and entropy-constrained learning vector quantization algorithms. The reformulation of LVQ and clustering algorithms also provides the basis for developing uncertainty measures that can identify feature vectors equidistant from all prototypes. These measures are employed by a procedure developed to make soft LVQ and clustering algorithms capable of identifying outliers in the data set. This procedure is evaluated by testing the algorithms generated by linear and exponential generator functions on speech data.  相似文献   

11.
聚类是一种非常有效的信息分析方法。针对现有基于粒子群优化的模糊C均值(Fuzzy C-means,FCM)聚类算法的聚类效果不佳的问题,提出一种基于改进粒子群优化的模糊C均值聚类算法,并将该聚类算法应用到移动界面模式的聚类中。首先,利用直觉模糊熵的几何解释和约束构造合理的直觉模糊熵;然后,在粒子群优化中使用直觉模糊熵判断种群的多样性程度,并引入混沌反向学习策略来提高全局搜索能力;最后,为了增强聚类算法的非线性处理能力,在聚类算法中加入高斯核函数,并将该聚类算法应用到移动界面模式的聚类中。移动界面模式聚类的实验表明,与现有聚类算法相比,文中所提聚类算法具有更好的聚类效果。  相似文献   

12.
半监督聚类的若干新进展   总被引:6,自引:0,他引:6  
半监督聚类方法利用少量标记数据提高聚类算法的性能,已逐渐发展成为模式识别及相关领域的研究热点.文中首先综述了半监督聚类算法的一些新进展,包括基于约束的方法、基于距离的方法和基于距离与约束的融合方法.然后提出一种基于约束的半监督模糊C-means聚类算法.实验表明,该算法与传统的模糊C-means及半监督K-means方法相比,具有更好的聚类精度.  相似文献   

13.
聚类问题是近几年来机器学习和数据挖掘领域研究的热点问题,由于获取大量监督信息费时费力,目前国内外研究的重点是如何获得少量但对聚类性能提高显著的监督信息,再加上实际问题中存在的动态模糊性,故本文提出一种结合主动学习的动态模糊聚类算法DF-DBSCAN,通过引入动态模糊等价关系、动态模糊信任测度和动态模糊似然测度这3个约束信息来指导DBSCAN的聚类过程,以提高聚类的性能。实验结果表明,DF-DBSCAN算法不仅解决了实际问题中存在的动态模糊性数据的描述和表示问题,而且能够高效地进行数据聚类,显著地提高聚类性能。   相似文献   

14.
一种基于类别融合的模糊最小最大聚类算法   总被引:1,自引:1,他引:1  
提出了一种新型的基于类别融合的模糊最小最大聚类算法,该算法首先使用初始类别生成子算法对归一化后的数据集进行预处理,从而生成一系列初始模式类别;然后利用类别融合于算法,将类别融合问题转化为求一无向图的连通子图问题,从而得出在同一连通子图中的点融合为同一类,连接子图的数目为最终的聚类数目。仿真结果表明,在处理未知模式类别数目且数据样本任意分布的数据集时,该算法明显优于传统的模糊C均值算法。  相似文献   

15.
Fuzzy clustering is a widely applied method for extracting the underlying models within data. It has been applied successfully in many real-world applications. Fuzzy c-means is one of the most popular fuzzy clustering methods because it produces reasonable results and its implementation is straightforward. One problem with all fuzzy clustering algorithms such as fuzzy c-means is that some data points which are assigned to some clusters have low membership values. It is possible that many samples may be assigned to a cluster with low-confidence. In this paper, an efficient and noise-aware implementation of support vector machines, namely relaxed constraints support vector machines, is used to solve the mentioned problem and improve the performance of fuzzy c-means algorithm. First, fuzzy c-means partitions data into appropriate clusters. Then, the samples with high membership values in each cluster are selected for training a multi-class relaxed constraints support vector machine classifier. Finally, the class labels of the remaining data points are predicted by the latter classifier. The performance of the proposed clustering method is evaluated by quantitative measures such as cluster entropy and Minkowski scores. Experimental results on real-life data sets show the superiority of the proposed method.  相似文献   

16.
This paper presents two new types of clustering algorithms by using tolerance vector called tolerant fuzzy c-means clustering and tolerant possibilistic clustering. In the proposed algorithms, the new concept of tolerance vector plays very important role. The original concept is developed to handle data flexibly, that is, a tolerance vector attributes not only to each data but also each cluster. Using the new concept, we can consider the influence of clusters to each data by the tolerance. First, the new concept of tolerance is introduced into optimization problems. Second, the optimization problems with tolerance are solved by using Karush–Kuhn–Tucker conditions. Third, new clustering algorithms are constructed based on the optimal solutions for clustering. Finally, the effectiveness of the proposed algorithms is verified through numerical examples and its fuzzy classification function.  相似文献   

17.
Picture fuzzy set (PFS), which is a generalization of traditional fuzzy set and intuitionistic fuzzy set, shows great promises of better adaptation to many practical problems in pattern recognition, artificial life, robotic, expert and knowledge-based systems than existing types of fuzzy sets. An emerging research trend in PFS is development of clustering algorithms which can exploit and investigate hidden knowledge from a mass of datasets. Distance measure is one of the most important tools in clustering that determine the degree of relationship between two objects. In this paper, we propose a generalized picture distance measure and integrate it to a novel hierarchical picture fuzzy clustering method called Hierarchical Picture Clustering (HPC). Experimental results show that the clustering quality of the proposed algorithm is better than those of the relevant ones.  相似文献   

18.
Recent advancement in microarray technology permits monitoring of the expression levels of a large set of genes across a number of time points simultaneously. For extracting knowledge from such huge volume of microarray gene expression data, computational analysis is required. Clustering is one of the important data mining tools for analyzing such microarray data to group similar genes into clusters. Researchers have proposed a number of clustering algorithms in this purpose. In this article, an attempt has been made in order to improve the performance of fuzzy clustering by combining it with support vector machine (SVM) classifier. A recently proposed real-coded variable string length genetic algorithm based clustering technique and an iterated version of fuzzy C-means clustering have been utilized in this purpose. The performance of the proposed clustering scheme has been compared with that of some well-known existing clustering algorithms and their SVM boosted versions for one simulated and six real life gene expression data sets. Statistical significance test based on analysis of variance (ANOVA) followed by posteriori Tukey-Kramer multiple comparison test has been conducted to establish the statistical significance of the superior performance of the proposed clustering scheme. Moreover biological significance of the clustering solutions have been established.  相似文献   

19.
针对基于改进模糊聚类的数据融合算法存在融合不精确、融合可信度较低等不足,为了解决多个同质传感器在无先验知识的情况下对同一个目标的某一特征进行测量的数据融合问题,提出了一种自适应模糊[C]均值聚类的数据融合算法,主要是把自适应模糊[C]均值聚类应用到数据融合中。该算法首先在改进的模糊聚类中通过引入自适应系数以发现不同形状和大小的聚类子集,使得融合结果更精确;其次将卡尔曼滤波原理和基于多层感知机的神经网络预测法应用到误差协方差估计中,提高了融合可信度。实验结果表明,与7种经典数据融合算法进行对比,该算法在4个模拟数据集与真实数据集上融合结果较好,特别在判别函数与融合误差方面优势更为明显。  相似文献   

20.
This paper presents a new recursive hybrid algorithm for training a radial basis function (RBF) network. The algorithm consists of a proposed clustering algorithm to position the RBF centres and the Givens least-squares algorithm to estimate the weights. This paper begins with a discussion about the problems of clustering in positioning RBF centres. Then a new clustering algorithm called adaptive fuzzy c-means clustering algorithm is proposed to reduce the problems. The capability of the proposed algorithm was tested to model three data sets: one simulated and two real data sets. It was found that the algorithm provided good performance. The performance of the algorithm was then compared with adaptive k-means, non-adaptive k-means and non-adaptive fuzzy cmeans clustering algorithms. Overall performance of the RBF network that used the proposed clustering algorithm was found to be much better than those that used other clustering algorithms. Simulation results also revealed that the algorithm was not sensitive to initial centres.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号