首页 | 本学科首页   官方微博 | 高级检索  
     


A time-efficient pattern reduction algorithm for k-means clustering
Authors:Ming-Chao Chiang [Author Vitae]  Chun-Wei Tsai [Author Vitae]  Chu-Sing Yang [Author Vitae]
Affiliation:a Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung 80424, Taiwan, ROC
b Department of Electrical Engineering, National Cheng Kung University, Tainan 70101, Taiwan, ROC
c Department of Applied Geoinformatics, Chia Nan University of Pharmacy & Science, Tainan 71710, Taiwan, ROC
Abstract:This paper presents an efficient algorithm, called pattern reduction (PR), for reducing the computation time of k-means and k-means-based clustering algorithms. The proposed algorithm works by compressing and removing at each iteration patterns that are unlikely to change their membership thereafter. Not only is the proposed algorithm simple and easy to implement, but it can also be applied to many other iterative clustering algorithms such as kernel-based and population-based clustering algorithms. Our experiments—from 2 to 1000 dimensions and 150 to 10,000,000 patterns—indicate that with a small loss of quality, the proposed algorithm can significantly reduce the computation time of all state-of-the-art clustering algorithms evaluated in this paper, especially for large and high-dimensional data sets.
Keywords:Data clustering  k-means  Pattern reduction
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号