首页 | 本学科首页   官方微博 | 高级检索  
     

基于网格耦合的数据流聚类
引用本文:张东月,周丽华,吴湘云,赵丽红.基于网格耦合的数据流聚类[J].软件学报,2019,30(3):667-683.
作者姓名:张东月  周丽华  吴湘云  赵丽红
作者单位:云南大学 信息学院, 云南 昆明 650000,云南大学 信息学院, 云南 昆明 650000,丽江师范高等专科学校, 云南 丽江 674199,云南大学 信息学院, 云南 昆明 650000
基金项目:国家自然科学基金(61762090,61262069,61472346,61662086);云南省自然科学基金(2016FA026,2015FB114);云南省创新研究团队项目(2018HC019);云南省高等学校科技创新团队项目(IRTSTYN)
摘    要:随着越来越多的应用程序产生数据流,数据流聚类分析的研究受到了广泛关注.基于网格的聚类通过将数据流映射到网格结构中形成数据概要,进而对概要进行聚类.这种方法通常具有较高的效率,但是每个网格独立处理,没有考虑网格之间的相互影响,因此聚类质量有待提高.在聚类过程中不再独立处理网格,而是考虑了网格之间的耦合关系,提出了一种基于网格耦合的数据流聚类算法.网格的耦合更加准确地表达了数据之间的相关性,从而提高了聚类的质量.在合成和真实数据流上的实验结果表明,所提算法具有较高的聚类质量和效率.

关 键 词:数据流  聚类分析  网格耦合  网格结构  聚类质量
收稿时间:2018/7/20 0:00:00
修稿时间:2018/9/20 0:00:00

Data Stream Clustering Based on Grid Coupling
ZHANG Dong-Yue,ZHOU Li-Hu,WU Xiang-Yun and ZHAO Li-Hong.Data Stream Clustering Based on Grid Coupling[J].Journal of Software,2019,30(3):667-683.
Authors:ZHANG Dong-Yue  ZHOU Li-Hu  WU Xiang-Yun and ZHAO Li-Hong
Affiliation:School of Information Science and Engineering, Yunnan University, Kunming 650000, China,School of Information Science and Engineering, Yunnan University, Kunming 650000, China,Lijiang Teachers College, Lijiang 674199, China and School of Information Science and Engineering, Yunnan University, Kunming 650000, China
Abstract:As more and more applications generate data streams, the research on data stream clustering analysis has received extensive attention. Grid-based clustering maps data streams into grid structures to form data summaries, and then clusters data summaries. This method usually has high efficiency, but each grid is processed independently, and the interaction between the grids is not considered, so the clustering quality needs to be improved. In this study, the coupling relationship between grids is considered rather than processed independently in the clustering process, and an algorithm for clustering data stream based on grid coupling is proposed. The proposed approach improves the quality of clusters as the coupling of the grid more accurately captures the correlation amongst the data. Experimental evaluations on synthetic and real data streams illustrate the superiority of the proposed approach compared with the state-of-the-arts approaches.
Keywords:data stream  clustering analysis  grid coupling  grid structure  the quality of cluster
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号