首页 | 本学科首页   官方微博 | 高级检索  
     

聚类算法研究综述
引用本文:陈新泉,周灵晶,刘耀中.聚类算法研究综述[J].集成技术,2017,6(3):41-49.
作者姓名:陈新泉  周灵晶  刘耀中
作者单位:1. 重庆三峡学院智能信息处理与控制重点实验室 重庆404100;电子科技大学大数据研究中心 成都611731;2. 重庆三峡学院智能信息处理与控制重点实验室 重庆404100;3. 中国石油塔里木油田分公司 库尔勒841000
基金项目:重庆市基础与前沿研究计划项目,重庆三峡学院科学研究项目计划资助,重庆市高校市级重点实验室资助项目
摘    要:聚类是数据挖掘研究领域的一种重要数据预处理方法,其目的是从无标签数据集中获得有价值数据集的内在分布结构,进而简化数据集的描述.历经几十年的研究,针对不同应用和数据特性己出现了千余种不同的聚类算法,但不同的聚类算法都有其特定的适用范围和不足.传统的聚类算法大致可分为划分聚类方法、层次聚类方法、密度聚类方法、网格聚类方法、模型聚类方法等.通过对传统聚类方法的回顾和总结,文章重点介绍了近年来出现的同步聚类算法、信念传播聚类算法和密度峰值聚类算法,并针对以上聚类算法的应用及发展方向进行了论述.

关 键 词:数据挖掘  聚类  信念传播  同步聚类  密度峰值

Review on Clustering Algorithms
Authors:CHEN Xinquan  ZHOU Lingjing and LIU Yaozhong
Abstract:Clustering is an important research topic in data mining domain for data preprocessing. Clustering is an unsupervised learning method that tries to find out some obvious clusters in the unlabeled data. It is usually performed by maximizing the similarity of inner-clusters and minimizing the similarity of inter-clusters. A lot of clustering algorithms have been proposed to solve various tasks and data properties in the past decades. However, all existing clustering methods have their own pros and cons, and there still lack of a clustering method with universality. Traditional clustering methods are usually classified into partitioning methods, hierarchical methods, density-based methods, grid-based methods and model-based methods. With a brief review to classical clustering methods, we put emphasis on introducing some recent emerging clustering methods like synchronization clustering algorithm, affinity propagation algorithm and density peaks algorithm. Based on the analysis and comparison of these algorithms, their potential applications and research directions are also discussed.
Keywords:data mining  clustering  affinity propagation  synchronization clustering  density peak
本文献已被 万方数据 等数据库收录!
点击此处可从《集成技术》浏览原始摘要信息
点击此处可从《集成技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号