首页 | 本学科首页   官方微博 | 高级检索  
     

考虑局部均值和类全局信息的快速近邻原型选择算法
引用本文:李娟,王宇平.考虑局部均值和类全局信息的快速近邻原型选择算法[J].自动化学报,2014,40(6):1116-1125.
作者姓名:李娟  王宇平
作者单位:1.西安电子科技大学计算机学院 西安 710071;
基金项目:国家自然科学基金(61272119)资助
摘    要:压缩近邻法是一种简单的非参数原型选择算法,其原型选取易受样本读取序列、异常样本等干扰.为克服上述问题,提出了一个基于局部均值与类全局信息的近邻原型选择方法.该方法既在原型选取过程中,充分利用了待学习样本在原型集中k个同异类近邻局部均值和类全局信息的知识,又设定原型集更新策略实现对原型集的动态更新.该方法不仅能较好克服读取序列、异常样本对原型选取的影响,降低了原型集规模,而且在保持高分类精度的同时,实现了对数据集的高压缩效应.图像识别及UCI(University of California Irvine)基准数据集实验结果表明,所提出算法集具有较比较算法更有效的分类性能.

关 键 词:数据分类    原型选择    局部均值    类全局信息    自适应学习
收稿时间:2013-06-19

A Fast Neighbor Prototype Selection Algorithm Based on Local Mean and Class Global Information
LI Juan,WANG Yu-Ping.A Fast Neighbor Prototype Selection Algorithm Based on Local Mean and Class Global Information[J].Acta Automatica Sinica,2014,40(6):1116-1125.
Authors:LI Juan  WANG Yu-Ping
Affiliation:1.School of Computer Science and Technology, Xidian University, Xi'an 710071;2.School of Distance Education, Shaanxi Normal University, Xi'an 710062
Abstract:The condensed nearest neighbor (CNN) algorithm is a simple non-parametric prototype selection method, but its prototype selection process is susceptible to pattern read sequence, abnormal patterns and so on. To deal with the above problems, a new prototype selection method based on local mean and class global information is proposed. Firstly, the proposed method makes full use of those local means of the k heterogeneous and homogeneous nearest neighbors to each be-learning pattern and the class global information. Secondly, an updating process is introduced to the proposed method. Lastly, updating strategies are adopted in order to realize dynamic update of the prototype set. The proposed method can not only better lessen the influence of the pattern selected sequence and abnormal patterns on prototype selection, but also reduce the scale of the prototype set. The proposed method can achieve a higher compression efficiency that can guarantee the higher classification accuracy synchronously for original data set. Two image recognition data sets and University of California Irvine (UCI) benchmark data sets are selected as experimental data sets. The experiments show that the proposed method based on the classification performance is more effective than the compared algorithms.
Keywords:Data classification  prototype selection  local mean  global class information  adaptive learning
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号