首页 | 本学科首页   官方微博 | 高级检索  
     

基于多目标微粒群优化的异质数据特征选择
引用本文:巩敦卫,胡滢,张勇.基于多目标微粒群优化的异质数据特征选择[J].电子学报,2014,42(7):1320-1326.
作者姓名:巩敦卫  胡滢  张勇
作者单位:中国矿业大学信息与电气工程学院, 江苏徐州 221116
基金项目:国家自然科学基金(No .61005089);江苏省自然科学基金(No .BK2011215);高等学校博士学科点专项科研基金(No .20100095120016);中国博士后科学基金
摘    要:环境和测量仪器精度的影响,使得采样数据的不同特征具有不同的质量.对这类异质数据进行特征选择,需要同时考虑特征子集确定分类器的准确度和可靠性,从而增加了特征选择的难度.本文研究异质数据的特征选择问题,提出一种基于多目标微粒群优化的特征选择方法.该方法首先以特征选择的概率为决策变量,将具有离散变量的特征选择问题,转化为连续变量多目标优化问题;然后,采用微粒群优化求解时,基于高斯采样,产生微粒的全局引导者,以提高Pareto解集的分布性;最后,依据储备集中元素更新的速度,确定需要扰动的微粒,以帮助微粒群跳出局部最优.将所提方法应用于多个典型数据集分类问题,实验结果表明了所提方法的有效性.

关 键 词:特征选择  异质数据  多目标优化  微粒群优化  高斯采样  
收稿时间:2013-04-15

Feature Selection of Heterogeneous Data Based on Multi-Objective Particle Swarm Optimization
GONG Dun-wei,HU Ying,ZHANG Yong.Feature Selection of Heterogeneous Data Based on Multi-Objective Particle Swarm Optimization[J].Acta Electronica Sinica,2014,42(7):1320-1326.
Authors:GONG Dun-wei  HU Ying  ZHANG Yong
Affiliation:School of Information and Electrical Engineering, China University of Mining and Technology, Xuzhou, Jiangsu 221116, China
Abstract:Different features of a sampling datum have different quality as a result the influence of the environment and the equipment precision.For the feature selection of this kind of heterogeneous data,both the accuracy and the reliability of the classifier determined by a feature subset are required to simultaneously consider,which enhances the difficulty of selecting features.The problem of the feature selection of heterogeneous data is focused on in this paper,and a method of selecting features is presented based on multi-objective particle swarm optimization.In this method,the above problem is first converted to a multi-objective optimization problem by regarding the probability of selecting a feature as the decision variable.When particle swarm optimization (PSO) is employed to solve the converted problem,the global guider of particles is generated by Gaussian sampling so as to improve the performance of Pareto solutions in distribution.In addition,the particle to be disturbed is determined according to the speed of updating a particle in the archive to help the swarm jump out of local optima.The proposed method is applied to classify several benchmark data sets,and the experimental results demonstrate its effectiveness.
Keywords:feature selection  heterogeneous data  multi-objective optimization  particle swarm optimization  Gaussian sam-pling
本文献已被 CNKI 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号