首页 | 本学科首页   官方微博 | 高级检索  
     


Simulated annealing for supervised gene selection
Authors:Maurizio Filippone  Francesco Masulli  Stefano Rovetta
Affiliation:(1) Department of Computing Science, University of Glasgow, Sir Alwyn Williams Building, G12 8QQ Glasgow, UK;(2) Department of Computer and Information Sciences, University of Genova, Genoa, Italy;(3) CNISM Genova Research Unit, Genoa, Italy;(4) Sbarro Institute for Cancer Research and Molecular Medicine, Center for Biotechnology, Temple University, Philadelphia, PA, USA
Abstract:Genomic data, and more generally biomedical data, are often characterized by high dimensionality. An input selection procedure can attain the two objectives of highlighting the relevant variables (genes) and possibly improving classification results. In this paper, we propose a wrapper approach to gene selection in classification of gene expression data using simulated annealing along with supervised classification. The proposed approach can perform global combinatorial searches through the space of all possible input subsets, can handle cases with numerical, categorical or mixed inputs, and is able to find (sub-)optimal subsets of inputs giving low classification errors. The method has been tested on publicly available bioinformatics data sets using support vector machines and on a mixed type data set using classification trees. We also propose some heuristics able to speed up the convergence. The experimental results highlight the ability of the method to select minimal sets of relevant features.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号