首页 | 本学科首页   官方微博 | 高级检索  
     

一种去除聚类数量和邻域参数设置的自适应聚类算法
引用本文:张柏恺,杨德刚,冯骥.一种去除聚类数量和邻域参数设置的自适应聚类算法[J].计算机工程与科学,2021,43(10):1838-1847.
作者姓名:张柏恺  杨德刚  冯骥
作者单位:(1.重庆师范大学计算机与信息科学学院,重庆 401331; 2.教育大数据智能感知与应用重庆市工程研究中心,重庆 401331)
基金项目:教育部人文社会科学研究项目(18XJC880002,20YJAZH084);重庆市教委科学技术研究项目(KJQN201800539);重庆市基础科学与前沿技术项目(cstc2016jcyjA0419)
摘    要:传统聚类方法往往无法避免邻域参数和聚类数量的选择问题,而这些参数在不同形状的数据中的最优选择也不尽相同,需要根据大量先验知识确定合适的参数选择范围.针对上述参数选择问题,提出了一种基于自然邻居思想的边界剥离聚类算法NaN-BP,能够在无需设置邻域参数和聚类数量的情况下得到令人满意的聚类结果.算法核心思想是首先根据数据集的分布特征,自适应迭代至对数稳定状态并获取邻域信息,并根据该邻域信息进行边界点的标记与剥离,最终以核心点为数据簇中心进行聚类.在不同规模不同分布的数据集上进行了广泛的对比实验,实验结果表明了NaN-BP的自适应性和有效性,取得了令人满意的实验结果.

关 键 词:聚类分析  自适应  自然邻居  对数稳定状态  核心点  
收稿时间:2020-08-06
修稿时间:2020-11-23

A self-adaptive clustering algorithm without neighborhood parameter k and cluster number c
ZHANG Bo-kai,YANG De-gang,FENG Ji.A self-adaptive clustering algorithm without neighborhood parameter k and cluster number c[J].Computer Engineering & Science,2021,43(10):1838-1847.
Authors:ZHANG Bo-kai  YANG De-gang  FENG Ji
Affiliation:(1.College of Computer and Information Science,Chongqing Normal University,Chongqing 401331; 2.Chongqing Engineering Research Center of  Educational Big Data Intelligent Perception and Application,Chongqing 401331,China)
Abstract:Traditional clustering methods often cannot avoid the selection of neighborhood parameters and the number of clusters. The optimal selection of these parameters in different shapes of data is hard to choose, and this choice is depending on prior knowledge. Aiming at the above parameter selection problem, this paper proposes a natural neighbors based border peeling clustering algorithm (NaN-BP), which can obtain satisfactory clustering results without setting the neighborhood parameters and the number of clusters. The core idea of the algorithm is to adaptively iterate to a logarithmic stable state and obtain neighborhood information according to the distribution characteristics of the data set, then mark and strip the boundary points according to the neighborhood information, and finally gather the core points as the center of the data cluster. Extensive comparative experiments is conducted on data sets of different scales and distributions, and satisfactory experimental results verify the adaptability and effectiveness of the algorithm.
Keywords:clustering analysis  self-adaptive  natural neighbor  logarithmic steady state  core point  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号