首页 | 本学科首页   官方微博 | 高级检索  
     

一种改进的区间型不确定数据模糊聚类方法
引用本文:肖满生,张龙信,张晓丽,胡永祥.一种改进的区间型不确定数据模糊聚类方法[J].电子与信息学报,2020,42(8):1968-1974.
作者姓名:肖满生  张龙信  张晓丽  胡永祥
作者单位:湖南工业大学 计算机学院 株洲 412007
基金项目:国家自然科学基金(61702178),湖南省自然科学基金(2018JJ4068),湖南省教育厅科研项目(18C0499)
摘    要:针对区间型不确定数据的特点,该文提出一种改进的模糊C均值聚类算法(IU-IFCM)。首先对区间型数据进行特征变换,由p维特征映射成由2p维特征组成的实数据,然后考虑区间中值与区间大小关系,设计一种样本距离计算方法,通过模糊C均值实现对区间型样本聚类。理论分析与对比实验表明,该算法的划分系数(PC)及正确等级(CR)值比其它方法平均提高10%以上,表明有更好的聚类精度,对当前大数据环境下不确定数据的分类提供了一种新的解决方案。

关 键 词:区间型数据    模糊C均值    影响因子    特征变换
收稿时间:2019-08-06

An Improved Fuzzy Clustering Method for Interval Uncertain Data
Mansheng XIAO,Longxin ZHANG,Xiaoli ZHANG,Yongxiang HU.An Improved Fuzzy Clustering Method for Interval Uncertain Data[J].Journal of Electronics & Information Technology,2020,42(8):1968-1974.
Authors:Mansheng XIAO  Longxin ZHANG  Xiaoli ZHANG  Yongxiang HU
Affiliation:School of Computer Science, Hunan University of Technology, Zhuzhou 412007, China
Abstract:An Improved Fuzzy C-Means clustering algorithm (IU-IFCM) is proposed in this study in accordance with the characteristics of Interval Uncertain data. First, the interval data is transformed into real data composed of 2p dimension feature, which is mapped from that of p dimension feature. Second, a method for calculating sample distance, which realizes the interval sample clustering by fuzzy c-mean algorithm, is designed while considering the relationship between interval median value and interval size. Theoretical analysis and comparison experiments show that the presented algorithm surpaes the compared algorithms by more than 10% on average in terms of the Partition Coefficient (PC) and Correct Rank(CR) value. These results indicate that the algorithm presents in this study has better clustering accuracy and provides a new solution for the classification of uncertain data in current big data environments.
Keywords:
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号