首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 109 毫秒
1.
针对模糊C—均值(FCM)聚类算法聚类结果依赖于初始中心的选取,易收敛于局部极值等问题,提出了一种密度峰值聚类(DPC)算法和FCM相结合的混合聚类方法(DPC-FCM),利用密度峰值快速搜索算法可以比较准确地刻画聚类初始中心的特点,改善FCM聚类算法存在的不足,从而实现优化聚类.在UCI数据集和人工模拟数据集上的实验结果显示:融合后的新算法和传统的FCM算法相比有着更高的正确率和更快的收敛速度,证明了新算法的可行性.  相似文献   

2.
基于二阶模糊聚类算法的雷达目标距离像识别   总被引:1,自引:0,他引:1  
彭翔  周代英 《计算机应用》2011,31(2):399-401
针对于模糊C-均值(FCM)算法敏感于聚类中心初始值的缺点,提出一种基于二阶模糊聚类方法。该方法利用传递闭包(TC)算法无初始化的优点,先对样本集按一定分类水平进行划分,选取若干类,求得这些类的样本均值作为FCM算法的初始聚类中心。一方面能够获得理想的聚类中心初始值,同时还能通过分类水平值来优化聚类中心数和聚类中心,避免局部最优,克服一致性聚类。利用该算法对三类飞机目标的实测一维距离像数据进行了识别实验,实验结果表明,基于二阶模糊聚类方法的识别率比FCM有了明显的改善。  相似文献   

3.
模糊C均值算法(FCM)是一种用于聚类的最流行的技术。不过,传统的FCM使用欧氏距离作为数据集的相似准则,从而导致数据集的划分有相等的趋势。而数据集的形状和簇的密度对聚类性能有高度影响。为了解决这个问题,提出基于簇密度的距离调节因子以修正相似性度量。同时,针对模糊C-均值(FCM)聚类算法对初始聚类中心选择敏感,易陷入局部最优的问题,采用量子粒子群优化算法以获取全局最优解。仿真实验证明,改进的聚类算法(QPSO-FCM-CD)具有良好的性能。  相似文献   

4.
为了改进模糊C-均值(FCM)聚类算法对初始值和噪声数据敏感,且易陷入局部极小值的缺点,提出一种基于选择和变异机制的蛙跳FCM算法(SMSFLA-FCM)。该算法首先将线性递减的惯性权重引入蛙跳算法的更新策略中,按照一定的概率选择适应度值较优的青蛙代替较差青蛙,并对每只青蛙个体以不同的概率变异;再用改进后的蛙跳算法求得最优解作为FCM算法的初始聚类中心;然后利用FCM优化初始聚类中心;最后求得全局最优解,从而有效克服了FCM算法的缺点。人造数据和经典数据集的实验结果表明,SMSFLA-FCM与SF-LA-FCM和FCM聚类算法相比,提高了算法的寻优能力,且迭代次数更少,聚类效果更好。  相似文献   

5.
一种基于隶属度优化的演化聚类算法   总被引:1,自引:0,他引:1  
针对FCM中数据点隶属度的计算是影响算法执行效率的主要因素,提出一种新的加速FCM算法(accelerated fuzzy C-means,AFCM),用于加速FCM及基于FCM的演化聚类算法.AFCM算法采用抽样初始化操作,产生较好的初始聚类中心,对于拥有较大隶属度的数据点,通过一步k-means操作更新模糊聚类中心,同时仅更新小隶属度来达到加速FCM算法的目的.为了验证所提出方法的有效性并提高聚类算法的效率,将AFCM应用于基于演化算法的模糊聚类算法.实验表明,此方法在保持良好的聚类结果前提下,能够减少大规模数据集上聚类算法的计算时间.  相似文献   

6.
模糊C均值聚类算法(FCM)是一种流行的聚类算法,在许多工程领域有着广泛的应用.密度加权的模糊C均值算法(Density Weighted FCM)是对传统FCM的一种改进,它可以很好的解决FCM对噪声敏感的问题.但是DWFCM与FCM都没有解决聚类结果很大程度上依赖初始聚类中心的选择好坏的问题.提出一种基于最近邻居节点对密度的FCM改进算法Improved-DWFCM,通过最近邻居节点估计节点密度的方法解决聚类结果对初始簇中心依赖的问题.仿真结果表明这种算法选择出来的初始聚类中心与最终结果的簇中心非常接近,大大提高了算法收敛的速度以及聚类的效果.  相似文献   

7.
改进的快速模糊C-均值聚类算法   总被引:4,自引:1,他引:4       下载免费PDF全文
为解决模糊C-均值(FCM)聚类算法在大数据量中存在的计算量大、运行时间过长的问题,提出了一种改进方法:先用多次随机取样聚类得到的类中心作为FCM算法的初始类中心,以减少FCM算法收敛所需的迭代次数;接着通过数据约减,压缩参与迭代运算的数据集,减少每次迭代过程的运算时间。该方法使FCM算法运算速度大大提高,且不影响算法的聚类效果。  相似文献   

8.
基于约简数据集的FCM聚类算法   总被引:1,自引:0,他引:1  
为了解决模糊C-均值(FCM)聚类算法在使用欧氏距离计算样本与类中心点的距离时计算量大的问题,提出了一种基于属性约简的FCM聚类算法.该算法根据粗糙集理论对初始数据进行属性约简,消除数据对象中的冗余值,然后再对约简后的属性集进行模糊聚类.实验结果表明,该算法能有效减少FCM算法的距离函数计算量,在不降低聚类精度的前提下,提高了FCM算法的执行效率.  相似文献   

9.
王治和  王淑艳  杜辉 《计算机工程》2021,47(5):88-96,103
模糊C均值(FCM)聚类算法无法识别非凸数据,算法中基于欧式距离的相似性度量只考虑数据点之间的局部一致性特征而忽略了全局一致性特征。提出一种利用密度敏感距离度量创建相似度矩阵的FCM算法。通过近邻传播算法获取粗类数作为最佳聚类数的搜索范围上限,以解决FCM算法聚类数目需要人为预先设定和随机选定初始聚类中心造成聚类结果不稳定的问题。在此基础上,改进最大最小距离算法,得到具有代表性的样本点作为初始聚类中心,并结合轮廓系数自动确定最佳聚类数。基于UCI数据集和人工数据集的实验结果表明,相比经典FCM、K-means和CFSFDP算法,该算法不仅具有识别复杂非凸数据的能力,而且能够在保证聚类性能和稳定性的前提下加快收敛速度。  相似文献   

10.
传统的快速聚类算法大多基于模糊C均值算法(Fuzzy C-means,FCM),而FCM对初始聚类中心敏感,对噪音数据敏感并且容易收敛到局部极小值,因而聚类准确率不高。可能性C-均值聚类较好地解决了FCM对噪声敏感的问题,但容易产生一致性聚类。将FCM和可能性C-均值聚类结合的聚类算法较好地解决了一致性聚类问题。为进一步提高算法收敛速度和鲁棒性,提出一种基于核的快速可能性聚类算法。该方法引入核聚类的思想,同时使用样本方差对目标函数中参数η进行优化。标准数据集和人造数据集的实验结果表明这种基于核的快速可能性聚类算法提高了算法的聚类准确率,加快了收敛速度。  相似文献   

11.
基于空间信息的可能性模糊C均值聚类遥感图像分割   总被引:1,自引:0,他引:1  
张一行  王霞  方世明  李晓冬  凌峰 《计算机应用》2011,31(11):3004-3007
可能性模糊C均值(PFCM)聚类算法作为模糊C均值(FCM)聚类算法的一种改进算法,能在一定程度上克服FCM算法对噪声的敏感性;但由于PFCM没有考虑像元间的空间信息,对含有较大噪声的图像分割效果依然不理想。为此,提出一种新的基于空间信息的PFCM算法(SPFCM),克服了PFCM算法对含有较大噪声的图像分割效果不佳的缺点。通过对人工图像和IKONOS遥感图像进行分析,结果表明,SPFCM算法无论是在视觉上还是在分割正确率上都优于传统的FCM算法、PFCM算法及两种加入空间信息的FCM算法;对于含有高斯噪声和盐椒噪声的图像,平均分割正确率高达99.71%,是一种去噪效果较好的图像分割算法。  相似文献   

12.
模糊C均值聚类(FCM)和可能性模糊C均值聚类(PFCM)没有考虑样本特征项及每个样本对聚类的贡献程度,存在对噪声较敏感的问题。特征减少的模糊聚类算法FRFCM可剔除数据集中无效特征量,且考虑了剩余特征量的权重,具有更好的聚类性能。对此,在可能性模糊C均值聚类算法(PFCM)的基础上将其与FRFCM算法相结合,提出新的特征逐减的可能性模糊C均值聚类算法(FRPFCM)。该算法解决了PFCM算法参数依赖的问题,且在迭代过程中可自动淘汰无效特征项并更新各特征项对聚类的贡献程度。对人工数据集以及UCI数据集进行测试的结果表明,提出的FRPFCM算法可得到更高的聚类准确率,所需迭代次数更少,算法收敛速度更快。  相似文献   

13.
A generalized form of Possibilistic Fuzzy C-Means (PFCM) algorithm (GPFCM) is presented for clustering noisy data. A function of distance is used instead of the distance itself to damp noise contributions. It is shown that when the data are highly noisy, GPFCM finds accurate cluster centers but FCM (Fuzzy C-Means), PCM (Possibilistic C-Means), and PFCM algorithms fail. FCM, PCM, and PFCM yield inaccurate cluster centers when clusters are not of the same size or covariance norm is used, whereas GPFCM performs well for both of the cases even when the data are noisy. It is shown that generalized forms of FCM and PCM (GFCM and GPCM) are also more accurate than FCM and PCM. A measure is defined to evaluate performance of the clustering algorithms. It shows that average error of GPFCM and its simplified forms are about 80% smaller than those of FCM, PCM, and PFCM. However, GPFCM demands higher computational costs due to nonlinear updating equations. Three cluster validity indices are introduced to determine number of clusters in clean and noisy datasets. One of them considers compactness of the clusters; the other considers separation of the clusters, and the third one considers both separation and compactness. Performance of these indices is confirmed to be satisfactory using various examples of noisy datasets.  相似文献   

14.
This article presents PFCM, a parallel algorithm for fuzzy clustering of large data sets. Being a generalization of FCM, the algorithm enables arbitrary numbers of data points, features and clusters to be handled cost-optimally by hypercube SIMD computers of arbitrary cube dimension, the only limitation being the size of the local memories of the processors. Speedup responds optimally to enlarging the hypercube. PFCM owes its flexibility to the technique employed in its derivation from the sequential fuzzy C-means algorithm FCM: the association of each of the three dimensions of the problem (numbers of data points, features and clusters) with a distinct subset of hypercube dimensions.  相似文献   

15.
A Possibilistic Fuzzy c-Means Clustering Algorithm   总被引:20,自引:0,他引:20  
In 1997, we proposed the fuzzy-possibilistic c-means (FPCM) model and algorithm that generated both membership and typicality values when clustering unlabeled data. FPCM constrains the typicality values so that the sum over all data points of typicalities to a cluster is one. The row sum constraint produces unrealistic typicality values for large data sets. In this paper, we propose a new model called possibilistic-fuzzy c-means (PFCM) model. PFCM produces memberships and possibilities simultaneously, along with the usual point prototypes or cluster centers for each cluster. PFCM is a hybridization of possibilistic c-means (PCM) and fuzzy c-means (FCM) that often avoids various problems of PCM, FCM and FPCM. PFCM solves the noise sensitivity defect of FCM, overcomes the coincident clusters problem of PCM and eliminates the row sum constraints of FPCM. We derive the first-order necessary conditions for extrema of the PFCM objective function, and use them as the basis for a standard alternating optimization approach to finding local minima of the PFCM objective functional. Several numerical examples are given that compare FCM and PCM to PFCM. Our examples show that PFCM compares favorably to both of the previous models. Since PFCM prototypes are less sensitive to outliers and can avoid coincident clusters, PFCM is a strong candidate for fuzzy rule-based system identification.  相似文献   

16.
Fuzzy Clustering Using A Compensated Fuzzy Hopfield Network   总被引:1,自引:0,他引:1  
Hopfield neural networks are well known for cluster analysis with an unsupervised learning scheme. This class of networks is a set of heuristic procedures that suffers from several problems such as not guaranteed convergence and output depending on the sequence of input data. In this paper, a Compensated Fuzzy Hopfield Neural Network (CFHNN) is proposed which integrates a Compensated Fuzzy C-Means (CFCM) model into the learning scheme and updating strategies of the Hopfield neural network. The CFCM, modified from Penalized Fuzzy C-Means algorithm (PFCM), is embedded into a Hopfield net to avoid the NP-hard problem and to speed up the convergence rate for the clustering procedure. The proposed network also avoids determining values for the weighting factors in the energy function. In addition, its training scheme enables the network to learn more rapidly and more effectively than FCM and PFCM. In experimental results, the CFHNN method shows promising results in comparison with FCM and PFCM methods.  相似文献   

17.
一种改进的可能模糊聚类算法*   总被引:2,自引:0,他引:2  
通过分析FCM、PCM、IPCM和PFCM等流行的聚类算法和它们在噪声环境下所面临的问题,提出一种概率模糊聚类新算法(SWPFCM),该算法结合样本加权和一种适用于噪音环境下的初始化聚类中心的方法,可以有效地消除噪声对聚类结果的影响。实验表明,SWPFCM算法具有处理大量噪声数据的能力,但对于没有噪声或噪声很少时,效果不明显,当目标样本集中出现噪声时,使用SWPFCM算法聚类将会得到满意的聚类结果。  相似文献   

18.
A possibilistic approach was initially proposed for c-means clustering. Although the possibilistic approach is sound, this algorithm tends to find identical clusters. To overcome this shortcoming, a possibilistic Fuzzy c-means algorithm (PFCM) was proposed which produced memberships and possibilities simultaneously, along with the cluster centers. PFCM addresses the noise sensitivity defect of Fuzzy c-means (FCM) and overcomes the coincident cluster problem of possibilistic c-means (PCM). Here we propose a new model called Kernel-based hybrid c-means clustering (KPFCM) where PFCM is extended by adopting a Kernel induced metric in the data space to replace the original Euclidean norm metric. Use of Kernel function makes it possible to cluster data that is linearly non-separable in the original space into homogeneous groups in the transformed high dimensional space. From our experiments, we found that different Kernels with different Kernel widths lead to different clustering results. Thus a key point is to choose an appropriate Kernel width. We have also proposed a simple approach to determine the appropriate values for the Kernel width. The performance of the proposed method has been extensively compared with a few state of the art clustering techniques over a test suit of several artificial and real life data sets. Based on computer simulations, we have shown that our model gives better results than the previous models.  相似文献   

19.

This paper presents a new method based on fuzzy cognitive map (FCM) and possibilistic fuzzy c-means (PFCM) clustering algorithm for categorizing celiac disease (CD). CD is a complex disorder whose development is affected by genetics (HLA alleles) and gluten ingestion. The celiac patients who are not treated are at a high risk of cancer, malignant lymphoma, and small bowel neoplasia. Therefore, CD diagnosis and grading are of paramount importance. The proposed FCM models human thinking for the purpose of classifying patients suffering from CD. We used the latest grading method where three grades A, B1, and B2 are used. To improve FCM efficiency and classification capability, a nonlinear Hebbian learning algorithm is applied for adjusting the FCM weights. To this end, 89 cases are studied. Three experts extracted seven main determinant characteristics of CD which were considered as FCM concepts. The mutual effects of these concepts on one another and on the final concept were expressed in the form of fuzzy rules and linguistic variables. Using the center of gravity defuzzifier, we obtained the numerical values of these weights and obtained the total weight matrix. Ultimately, combining the FCM model with PFCM algorithm, we obtained the grades A, B1, and B2 accuracies as 88, 90, and 91%, respectively. The main advantage of the proposed FCM is the good transparency and interpretability in the decision-making procedure, which make it a suitable tool for daily usage in the clinical practice.

  相似文献   

20.
K-Means聚类算法和FCM算法混合运行的角度来探讨聚类问题,针对FCM算法初始化隶属度矩阵的随机性问题,提出了一种混合均值聚类算法。在混合算法运行过程中,利用前者的聚类结果信息来初始化后者的初始中心,依此来计算FCM算法初始隶属度矩阵,通过FCM算法的运行,最终实现数据集的聚类目的。实验结果表明该混合均值算法比单纯使用FCM算法效果好。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号