首页 | 本学科首页   官方微博 | 高级检索  
     

带约束小生境二进制粒子群优化的生物组学数据集成特征选择
引用本文:杨峻山,周家锐,朱泽轩,纪震. 带约束小生境二进制粒子群优化的生物组学数据集成特征选择[J]. 信号处理, 2016, 32(7): 757-763. DOI: 10.16798/j.issn.1003-0530.2016.07.001
作者姓名:杨峻山  周家锐  朱泽轩  纪震
作者单位:1.深圳大学信息工程学院,广东深圳 518060
基金项目:国家自然科学基金资助项目(61171125,61001185,61211130120);广东省自然科学基金资助项目(S2012010009545);深圳知识创新计划基础研究(JCYJ20130329115450637 );深圳市海外高层次人才创新创业资助研发项目(KQC201108300045A)
摘    要:针对生物组学数据高维小样本的特点而引起的分类误差较大的问题,提出了一种带约束小生境二进制粒子群优化的集成特征选择方法。该方法利用二进制粒子群优化算法搜索分类准确率最高的特征子集,通过约束粒子编码的置位个数以限制选择特征个数,并加入多模优化中的小生境技术使算法能够一次获得多个差异度较大的特征子集,最后采用集成学习技术将基于多特征子集建立的基分类器集成为强分类器并对数据进行分类学习。实验结果表明,该特征选择方法在生物组学数据上能够稳定选择较少特征并获得较好分类性能。 

关 键 词:生物组学   特征选择   粒子群优化   小生境
收稿时间:2015-11-09

Ensemble Feature Selection Based on Constrained Niching Binary Particle Swarm Optimization for Omics Data Classification
Affiliation:1.College of Engineering and Information, Shenzhen University, Shenzhen, Guangdong 518060, China2.College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, Guangdong 518060, China3.School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong 518055, China4.Shenzhen Key Laboratory of Embedded System Design,Shenzhen, Guangdong 518060, China
Abstract:Classification of omics data suffers from the high error rate due to their high dimensional and small sample size characteristics. To overcome the problem, this paper proposes an ensemble feature selection for omics data classification based on constrained niching binary particle swarm optimization (PSO). Particularly, optimal feature subsets in terms of best classification accuracy are identified by the binary PSO. The proposed method introduces constraint on the particle encoding to constrain the number of selected features, and niching technique from multimodal optimization is imposed to enable the algorithm to obtain multiple diverse feature subsets in a single run. Afterward, multiple base classifiers built on the obtained feature subsets are combined into a stronger classifier which is applied to classify the omics data. Experimental results on real world omics datasets demonstrate that the proposed feature selection method can stably select compact feature subsets and obtain promising classification performance. 
Keywords:
点击此处可从《信号处理》浏览原始摘要信息
点击此处可从《信号处理》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号