首页 | 本学科首页   官方微博 | 高级检索  
     

基于密度的划分式聚类过程参数选择算法
引用本文:吴杨,王韬,李进东.基于密度的划分式聚类过程参数选择算法[J].控制与决策,2016,31(1):21-29.
作者姓名:吴杨  王韬  李进东
作者单位:中国人民解放军军械工程学院信息工程系,石家庄050003.
基金项目:

国家自然科学基金项目(61173191);军内科研项目(YJJXM12033).

摘    要:

为确定??-means 等聚类算法的初始聚类中心, 首先由样本总量及其取值区间长度确定对应维上的样本密度统计区间数, 并将满足筛选条件的密度峰值所在区间内的样本均值作为候选初始聚类中心; 然后, 根据密度峰值区间在各维上的映射关系建立候选初始聚类中心关系树, 进一步采用最大最小距离算法获得初始聚类中心; 最后为确定最佳聚类数, 基于类内样本密度及类密度建立聚类有效性评估函数. 针对人工数据集及UCI 数据集的实验结果表明了所提出算法的有效性.



关 键 词:

聚类算法|聚类中心|样本密度|关系树|最大最小距离

收稿时间:2014/10/19 0:00:00
修稿时间:2015/1/26 0:00:00

Clustering parameters selection algorithm based on density for divisional clustering process
WU Yang WANG Tao LI Jin-dong.Clustering parameters selection algorithm based on density for divisional clustering process[J].Control and Decision,2016,31(1):21-29.
Authors:WU Yang WANG Tao LI Jin-dong
Abstract:

In order to select the initial clustering centers for the divisional clustering algorithm such as the ??-means algorithm, the sample density calculating regions number of each dimension is confirmed according to the samples number and their values, firstly. Then, the average value of the samples of the region with peak value satisfying the filtering conditions is taken as the candidate for the initial clustering center, and a relationship tree of the candidates is established on the mapping relations of the regions. Furthermore, the initial clustering centers are selected by using the maximal-minimal distance algorithm. To confirm the best number of the clusters, a clustering quality evaluation function is established according to the sample density and cluster density. Experiment results of the manual and UCI data sets show the effectiveness of the proposed algorithms.

Keywords:

clustering algorithm|clustering center|sample density|relationship tree|maximal-minimal distance

点击此处可从《控制与决策》浏览原始摘要信息
点击此处可从《控制与决策》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号