首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 140 毫秒
1.
一种快速山峰聚类算法*   总被引:1,自引:1,他引:0  
山峰聚类既可以对数据集进行近似聚类,又可以为其他聚类方法提供聚类所需的初始聚类中心。减法聚类是山峰聚类的改进,它避免了山峰聚类中出现的计算量随样本维数增加呈指数增长的情况。但减法聚类对处理大样本集也力不从心。引入了P-tree数据结构,对高维大样本集进行分解,然后用减法聚类对子样本集进行聚类。此算法既避免了山峰聚类的维数灾难问题,也解决了减法聚类中样本数太大的问题。实验结果证明,该算法有效地减少了运算量,提高了聚类的速度。  相似文献   

2.
数据挖掘中聚类算法研究进展   总被引:6,自引:0,他引:6  
聚类分析是数据挖掘中重要的研究内容之一,对聚类准则进行了总结,对五类传统的聚类算法的研究现状和进展进行了较为全面的总结,就一些新的聚类算法进行了梳理,根据样本归属关系、样本数据预处理、样本的相似性度量、样本的更新策略、样本的高维性和与其他学科的融合等六个方面对聚类中近20多个新算法,如粒度聚类、不确定聚类、量子聚类、核聚类、谱聚类、聚类集成、概念聚类、球壳聚类、仿射聚类、数据流聚类等,分别进行了详细的概括。这对聚类是一个很好的总结,对聚类的发展具有积极意义。  相似文献   

3.
文本聚类中基于密度聚类算法的研究与改进   总被引:1,自引:0,他引:1  
文本聚类在很多领域都有广泛应用,而聚类算法作为文本聚类的核心直接决定了聚类的效果和效率。结合基于划分的聚类算法和基于密度的聚类算法的优点,提出了基于密度的聚类算法DBCKNN。算法利用了k近邻和离群度等概念,能够迅速确定数据集中每类的中心及其类半径,在保证聚类效果的基础上提高了聚类效率。  相似文献   

4.
在减法聚类中一般只采用固定聚类半径训练聚类的做法,这样就可能得到过多或过少的聚类中心。为了提高聚类结果的最优性,从数据样本空间密度疏密程度考虑出发,改进了减法聚类算法,不仅改善了原来聚类半径只能固定的做法,而且适应于聚类大小不同时的数据聚类,使得调节参数更少,并结合模糊C-均值(FCM)聚类可以获得更为合理和准确的聚类个数和聚类中心,在一定程度上拓展了聚类算法的适用性。最后通过仿真证明了改进算法的有效性。  相似文献   

5.
半监督聚类是对原有聚类算法进行改进来提高聚类质量。先验知识以约束的形式对聚类进行指导。本文主要对半监督学习中的聚类算法展开研究,分析约束的主要作用及其影响方式,并在此基础上提出新的半监督聚类算法。在对半监督聚类的现状进行了分析后,本文提出了一种新的半监督聚类--诱导半监督聚类。该半监督聚类算法通过对约束的选择,改变其监督聚类的方向,获得质量可信的具有最大相异性的聚类结果。  相似文献   

6.
文本聚类技术的有效性验证   总被引:3,自引:0,他引:3       下载免费PDF全文
讨论了利用分类测试集进行聚类量化评价的标准。在此基础上选择k-Means聚类算法、STC(后缀树聚类)算法和基于Ant的聚类算法进行了实验对比。实验表明,STC聚类算法在处理文本时充分考虑了文本的特性,其聚类效果较好;基于Ant的聚类算法在聚类的划分时效果受参数输入的影响较大,其聚类结果与STC相比并不具有优势;在Ant聚类算法中引入文本特性后,可以提高文本聚类的效果。  相似文献   

7.
文本聚类在文本挖掘和信息检索系统中发挥着重要的作用,而词聚类是文本聚类的基础。提出了一种基于混合聚类的中文词聚类方法,它将层次聚类和概念聚类结合起来,以缩短整个聚类时间。首先对预处理后的词集进行初始聚类,然后从每个类中各取一个出现次数最多的词组成新的词集,最后对该词集进行再聚类。实验表明,这种方法有效降低了中文词聚类的时间复杂度。  相似文献   

8.
杨辉  彭晗  朱建勇  聂飞平 《计算机仿真》2021,38(8):328-332,343
谱聚类可以任意形状的数据进行聚类,在聚类集成中能够有效的提高基聚类的质量.以往的聚类集成算法中,聚类集成得到的结果并不是最终聚类结果,还需要利用聚类算法来获得最终聚类结果,在整个过程中会使得解由离散-连续-离散的转变.提出了一种基于谱聚类的双边聚类集成算法.算法首先在生成阶段使用谱聚类算法来获得基聚类,通过标准互信息来选取基聚类.将选出来基聚类和样本作为图的顶点,并对构建的图利用双边聚类算法对基聚类和样本同时聚类直接得到最终聚类结果.在实验中,将所提方法与一些聚类集成算法进行了比较,取得了较好的结果.  相似文献   

9.
传统的谱聚类算法对初始化敏感,针对这个缺陷,引入Canopy算法对样本进行“粗”聚类得到初始聚类中心点,将结果作为K-Means算法的输入,提出了一种基于Canopy和谱聚类融合的聚类算法(Canopy-SC),减少了传统谱聚类算法选择初始中心点的盲目性,并将其用于人脸图像聚类。与传统的谱聚类算法相比,Canopy-SC算法能够得到较好的聚类中心和聚类结果,同时具有更高的聚类精确度。实验结果表明了该算法的有效性和可行性。  相似文献   

10.
子空间聚类改进算法研究综述   总被引:1,自引:0,他引:1  
李霞  徐树维 《计算机仿真》2010,27(5):174-177
高维数据聚类是聚类技术的难点和重点,子空间聚类是实现高维数据集聚类的有效途径。CLIQUE算法是最早提出的基于密度和网格的子空间聚类算法,自动子空间聚类算法的实用性和高效性,带来了子空间聚类算法的空前发展。深入分析CLIQUE算法的优点和局限性;介绍了一些近几年提出的子空间聚类算法,并针对CLIQUE算法的局限性作了改进,聚类的效率和精确性得到了提高;最后对子空间聚类算法的发展趋势进行了讨论。  相似文献   

11.
Part I of this paper proposes a definition of the adaptive resonance theory (ART) class of constructive unsupervised on-line learning clustering networks. Class ART generalizes several well-known clustering models, e.g., ART 1, improved ART 1, adaptive Hamming net (AHN), and Fuzzy ART, which are optimized in terms of memory storage and/or computation time. Next, the symmetric Fuzzy ART (S-Fuzzy ART) network is presented as a possible improvement over Fuzzy ART. As a generalization of S-Fuzzy ART, the simplified adaptive resonance theory (SART) group of ART algorithms is defined. Gaussian ART (GART), which is found in the literature, is presented as one more instance of class SART. In Part II of this work, a novel SART network, called fully self-organizing SART (FOSART), is proposed and compared with Fuzzy ART, S-Fuzzy ART, GART and other well-known clustering algorithms. Results of our comparison may easily extend to the ARTMAP supervised learning framework.  相似文献   

12.
喻敏  吴江 《计算机科学》2011,38(9):190-192
客户信用评佑对于银行的经营管理有着重要的意义,为此提出了一种基于多进化神经网络的信用评估模型(MNN-CREDIT)。该模型基于客户信货数据,利用基于聚类的小生境遗传算法并行地训练出多个精度高、差异性大的三层前馈神经网络,然后将待识别的客户数据分别输入,最后根据动态投票法集成最终信用预测结果。利用德国信用数据库真实数据集进行了实证分析,结果表明,基于多进化神经网络的信用评估模型具有较高的预测精度。  相似文献   

13.
In this study, a credit risk concentration allocation model is developed for the banks to determine the credit risk concentration limits of their regional head’s. The proposed model is based on the Fuzzy TOPSIS (FTOPSIS) and Linear Programming (LP) approaches. FTOPSIS is easy to use and capable to keep tract of decision made in reaching the overall score by combining different types of criteria. LP combines the results of FTOPSIS and other constraints and objectives determined by the bank. Using FTOPSIS and LP together in the same model brings uniformity and a structure in credit risk concentration decisions of the banks. The developed model is tested with a real case banking application and satisfactory results are obtained. An application is also provided in the paper for illustrative purposes.  相似文献   

14.
The modified fuzzy art and a two-stage clustering approach to cell design   总被引:1,自引:0,他引:1  
This study presents a new pattern recognition neural network for clustering problems, and illustrates its use for machine cell design in group technology. The proposed algorithm involves modifications of the learning procedure and resonance test of the Fuzzy ART neural network. These modifications enable the neural network to process integer values rather than binary valued inputs or the values in the interval [0, 1], and improve the clustering performance of the neural network. A two-stage clustering approach is also developed in order to obtain an informative and intelligent decision for the problem of designing a machine cell. At the first stage, we identify the part families with very similar parts (i.e., high similarity exists in their processing requirements), and the resultant part families are input to the second stage, which forms the groups of machines. Experimental studies show that the proposed approach leads to better results in comparison with those produced by the Fuzzy ART and other similar neural network classifiers.  相似文献   

15.
为科学测评科技企业孵化器的人才孵化效率,提出科技企业人才孵化效率S-FNN评价模型。在研究科技企业人才孵化基本过程的基础上,构建了科技企业人才孵化效率评价指标体系;运用模糊减聚类法对网络规则进行处理,减少神经网络规则数目,以免参数膨胀导致网络难以训练,采用粒子群与遗传算法确定模糊网络参数值;运用模糊神经网络对科技企业人才孵化效率进行综合评价;通过应用案例验证了该评价模型的有效性和可行性。  相似文献   

16.
For pt.I see ibid., p.645-61 (2002). Part I of this paper defines the class of constructive unsupervised on-line learning simplified adaptive resonance theory (SART) clustering networks. Proposed instances of class SART are the symmetric fuzzy ART (S-Fuzzy ART) and the Gaussian ART (GART) network. In Part II of our work, a third network belonging to class SART, termed fully self-organizing SART (FOSART), is presented and discussed. FOSART is a constructive, soft-to-hard competitive, topology-preserving, minimum-distance-to-means clustering algorithm capable of: 1) generating processing units and lateral connections on an example-driven basis and 2) removing processing units and lateral connections on a minibatch basis. FOSART is compared with Fuzzy ART, S-Fuzzy ART, GART and other well-known clustering techniques (e.g., neural gas and self-organizing map) in several unsupervised learning tasks, such as vector quantization, perceptual grouping and 3-D surface reconstruction. These experiments prove that when compared with other unsupervised learning networks, FOSART provides an interesting balance between easy user interaction, performance accuracy, efficiency, robustness, and flexibility  相似文献   

17.
研究企业信用风险评估准确性问题,企业存在产品质量、不良贷款等信用风险问题,企业信用风险是多种因素的综合结果,存在着不确定、非线性、随机性等特点,无法建立确定数学评估模型。只能根据专家评估指标为依据。为了提高企业信用风险评估准确率,提出一种BP神经网络的企业信用风险评估方法。先采用层次分析法构建风险评估指标体系,再用专家系统对评估指标进行量化打分,最后采用BP神经网络对企业信用风险指标进行非线性学习,并对企业信用风险等级进行评估。实验结果表明,BP神经网络的企业信用风险评估模模型能显著提高评估准确率,并能够反映企业信用风险的随机性变化特点,使评估结果更加符合实际情况,为企业信用风险评估提供了参考。  相似文献   

18.
The main goal of all commercial banks is to collect the savings of legal and real persons and allocate them as credit to industrial, services and production companies. Non repayment of such credits cause many problems to the banks such as incapability to repay the central bank’s loans, increasing the amount of credit allocations comparing to credit repayment and incapability to allocate more credits to customers. The importance of credit allocation in banking industry and it’s important role in economic growth and employment creation leads the development of many models to evaluate the credit risk of applicants. But many of these models are classic and are incapable to do credit evaluation completely and efficiently. Therefore the demand to use artificial intelligence in this field has grown up. In this paper after providing appropriate credit ranking model and collecting expert’s knowledge, we design a hybrid intelligent system for credit ranking using reasoning-transformational models. Expert system as symbolic module and artificial neural network as non-symbolic module are components of this hybrid system. Such models provide the unique features of each components, the reasoning and explanation of expert system and the generalization and adaptability of artificial neural networks. The results of this system demonstrate hybrid intelligence system is more accurate and powerful in credit ranking comparing to expert systems and traditional banking models.  相似文献   

19.
基于改进ART2算法的数据聚类方法   总被引:1,自引:0,他引:1  
通过分析经典自适应谐振(adaptive resonance theory,ART)神经网络聚类过程中向量规格化、警戒参数全局化等特点,指出其无法应用于向量的模是与类别特征相关的情况、不能根据空间密度变化以不同粒度划分空间、输出无层次结构等不足,提出一种改进ART2神经网络算法.该算法提出警戒参数局部化和以模为标准的神经元预选,通过循环形成与模相关的多层次动态聚类结构(粗粒度的聚类不需重新训练神经网络),除此还降低对警戒参数主观设置的要求;这些优点能够适应模是重要特征的空间中子空间分别聚类的情况.  相似文献   

20.
对模糊ART神经网络模型中的类别选择方法进行改进,并在权值向量的修改规则中引入隶属度,得到一种改进的Fuzzy ART学习算法。IRIS数据分类结果证明了新方法的可行性。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号