首页 | 本学科首页   官方微博 | 高级检索  
     

基于CURE聚类算法改进的原型选择算法
引用本文:孙元元,张德生,张晓.基于CURE聚类算法改进的原型选择算法[J].计算机系统应用,2019,28(8):162-169.
作者姓名:孙元元  张德生  张晓
作者单位:西安理工大学 理学院,西安,710054;西安理工大学 理学院,西安,710054;西安理工大学 理学院,西安,710054
摘    要:针对传统K近邻分类器在大规模数据集中存在时间和空间复杂度过高的问题,可采取原型选择的方法进行处理,即从原始数据集中挑选出代表原型(样例)进行K近邻分类而不降低其分类准确率.本文在CURE聚类算法的基础上,针对CURE的噪声点不易确定及代表点分散性差的特点,利用共享邻居密度度量给出了一种去噪方法和使用最大最小距离选取代表点进行改进,从而提出了一种新的原型选择算法PSCURE (improved prototype selection algorithm based on CURE algorithm).基于UCI数据集进行实验,结果表明:提出的PSCURE原型选择算法与相关原型算法相比,不仅能筛选出较少的原型,而且可获得较高的分类准确率.

关 键 词:K近邻分类器  原型选择  共享邻居密度  CURE层次聚类  代表点
收稿时间:2019/1/23 0:00:00
修稿时间:2019/2/26 0:00:00

Improved Prototype Selection Algorithm Based on CURE Algorithm
SUN Yuan-Yuan,ZHANG De-Sheng and ZHANG Xiao.Improved Prototype Selection Algorithm Based on CURE Algorithm[J].Computer Systems& Applications,2019,28(8):162-169.
Authors:SUN Yuan-Yuan  ZHANG De-Sheng and ZHANG Xiao
Affiliation:Faculty of Science, Xi''an University of Technology, Xi''an 710054, China,Faculty of Science, Xi''an University of Technology, Xi''an 710054, China and Faculty of Science, Xi''an University of Technology, Xi''an 710054, China
Abstract:Since the traditional K-nearest neighbor classifier possesses large time and space complexity for larger-scale data sets, prototype selection is an effective processed method which selects representative prototypes (instances) from the original data set for K-nearest neighbor classifier without reducing the classification accuracy. At present, there exist many prototype selection methods. In this paper, based on the existing CURE algorithm, which is difficult to determine the noise points and has bad dispersed of representative points, the shared neighbor density metric is presented to delete noise points and the maximum and minimum distances are employed to obtain scattered representative points, which generates a novel prototype selection methods PSCURE (improved Prototype Selection algorithm based on CURE algorithm). Some numerical experiments are further conducted to show the performance of the proposed prototype selection algorithm compared with other related prototype selection algorithms. The experimental results show that the proposed algorithm not only can select fewer prototypes but also can achieve higher classifier accuracy for almost all the data sets.
Keywords:K nearest neighbor classifier  prototype selection  shared neighbor density  CURE  representative point
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号