首页 | 本学科首页   官方微博 | 高级检索  
     


A new hybrid method based on partitioning-based DBSCAN and ant clustering
Authors:Hua Jiang  Jing Li  Shenghe Yi  Xiangyang Wang  Xin Hu
Affiliation:1. Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran;2. Future Technology Research Center, National Yunlin University of Science and Technology, 123 University Road, Section 3, Douliou, Yunlin 64002, Taiwan, Republic of China;3. Department of Computer Engineering, Tabriz Branch, Islamic Azad University, Tabriz, Iran;4. Institute of Research and Development, Duy Tan University, Da Nang 550000, Vietnam;5. Mental Health Research Center, Psychosocial Health Research Institue, Iran University of Medical Sciences, Tehran, Iran;6. Department of Information Technology, University of Human Development, Sulaymaniyah, Iraq;1. Computer Science Department, Federal University of the State of Espirito Santo, Vitória, Brazil;2. Statistical Department, Federal University of the State of Espirito Santo, Vitória, Brazil
Abstract:Clustering problem is an unsupervised learning problem. It is a procedure that partition data objects into matching clusters. The data objects in the same cluster are quite similar to each other and dissimilar in the other clusters. Density-based clustering algorithms find clusters based on density of data points in a region. DBSCAN algorithm is one of the density-based clustering algorithms. It can discover clusters with arbitrary shapes and only requires two input parameters. DBSCAN has been proved to be very effective for analyzing large and complex spatial databases. However, DBSCAN needs large volume of memory support and often has difficulties with high-dimensional data and clusters of very different densities. So, partitioning-based DBSCAN algorithm (PDBSCAN) was proposed to solve these problems. But PDBSCAN will get poor result when the density of data is non-uniform. Meanwhile, to some extent, DBSCAN and PDBSCAN are both sensitive to the initial parameters. In this paper, we propose a new hybrid algorithm based on PDBSCAN. We use modified ant clustering algorithm (ACA) and design a new partitioning algorithm based on ‘point density’ (PD) in data preprocessing phase. We name the new hybrid algorithm PACA-DBSCAN. The performance of PACA-DBSCAN is compared with DBSCAN and PDBSCAN on five data sets. Experimental results indicate the superiority of PACA-DBSCAN algorithm.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号