首页 | 本学科首页   官方微博 | 高级检索  
     

一种改进的粗k 均值聚类算法
引用本文:王 莉,周献中,沈 捷.一种改进的粗k 均值聚类算法[J].控制与决策,2012,27(11):1711-1714.
作者姓名:王 莉  周献中  沈 捷
作者单位:南京大学工程管理学院;南京工业大学自动化与电气工程学院
基金项目:国家自然科学基金项目(70971062);东南大学复杂工程系统测量与控制教育部重点实验室开放课题(2010A004)
摘    要:Lingras提出的粗K均值聚类算法易受随机初始聚类中心和离群点的影响,可能出现一致性和无法收敛的聚类结果.对此,提出一种改进的粗K均值算法,选择潜能最大的K个对象作为初始的聚类中心,根据数据对象与聚类中心的相对距离来确定其上下近似归属,使边界区域的划分更合理.定义了广义分类正确率,该指标同时考虑了下近似集和边界区域中的对象,评价算法性能更准确.仿真实验结果表明,该算法分类正确率高,收敛速度快,能够克服离群点的不利影响.

关 键 词:聚类  粗糙集  粗K均值  广义分类正确率
收稿时间:2011/5/19 0:00:00
修稿时间:2011/11/4 0:00:00

An improved rough k-means clustering algorithm
WANG Li,ZHOU Xian-zhong,SHEN Jie.An improved rough k-means clustering algorithm[J].Control and Decision,2012,27(11):1711-1714.
Authors:WANG Li  ZHOU Xian-zhong  SHEN Jie
Affiliation:1.School of Engineering and Management,Nanjing University,Nanjing 210093,China;2.School of Automation and Electrical Engineering,Nanjing University of Technology,Nanjing 210009,China.)
Abstract:

Rough k-means clustering algorithm proposed by Lingras is sensitive to the initial centers of the k cluster and
outliers and may result in identical clustering and non-convergence. In this paper, an improved rough k-means clustering
algorithm is proposed. The k objects with maximum potentials are chosen as initial centers. The absolute distance between
object and center of clusters is considered to decide whether a data object belongs to the lower or upper approximation set
of a cluster, so the division of boundary area is more reasonable. General classification accuracy considering the objects in
lower approximation set and boundary area is defined for rough k-means clustering algorithm, and it is more appropriate for
evaluating rough k means clustering. The simulation results show that, the proposed algorithm has the advantages of high
classification accuracy and fast convergence, and can also avoid the bad influence of outlier.

Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《控制与决策》浏览原始摘要信息
点击此处可从《控制与决策》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号