首页 | 本学科首页   官方微博 | 高级检索  
     


Distance and density based clustering algorithm using Gaussian kernel
Affiliation:1. Faculty of Information Technology, Ho Chi Minh City University of Technology, Ho Chi Minh City, Vietnam;2. College of Electronics and Information Engineering, Sejong University, Seoul, Republic of Korea;3. Division of Data Science, Ton Duc Thang University, Ho Chi Minh City, Vietnam;4. Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam;5. Department of Electrical and Computer Engineering, University of Alberta, Edmonton, T6R 2V4 AB, Canada;6. Department of Electrical and Computer Engineering, Faculty of Engineering, King Abdulaziz University, Jeddah, 21589, Saudi Arabia;7. Systems Research Institute, Polish Academy of Sciences, Warsaw, Poland;1. Department of Economics and Statistics, University of Naples Federico II, Italy;2. Department of Industrial Engineering, University of Naples Federico II, Italy;1. University of Hildesheim, Universitätsplatz 1, 31141 Hildesheim, Germany;2. University of Eichstätt, Ingolstadt, Germany;1. College of Computer Science and Technology, Jilin University, Changchun 130012, China;2. Key Laboratory of Symbol Computation and Knowledge Engineering of the Ministry of Education, Changchun 130012, China
Abstract:Clustering is an important field for making data meaningful at various applications such as processing satellite images, extracting information from financial data or even processing data in social sciences. This paper presents a new clustering approach called Gaussian Density Distance (GDD) clustering algorithm based on distance and density properties of sample space. The novel part of the method is to find best possible clusters without any prior information and parameters. Another novel part of the algorithm is that it forms clusters very close to human clustering perception when executed on two dimensional data. GDD has some similarities with today’s most popular clustering algorithms; however, it uses both Gaussian kernel and distances to form clusters according to data density and shape. Since GDD does not require any special parameters prior to run, resulting clusters do not change at different runs. During the study, an experimental framework is designed for analysis of the proposed clustering algorithm and its evaluation, based on clustering performance for some characteristic data sets. The algorithm is extensively tested using several synthetic data sets and some of the selected results are presented in the paper. Comparative study outcomes produced by other well-known clustering algorithms are also discussed in the paper.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号