首页 | 本学科首页   官方微博 | 高级检索  
     


Feature selection based on artificial bee colony and gradient boosting decision tree
Affiliation:1. Facultad de Economía y Negocios, Universidad Finis Terrae, Santiago, Chile;2. Facultad de Ingeniería y Ciencias, Universidad Adolfo Ibáñez, Santiago, Chile;3. Facultad de Ingeniería, Ciencia y Tecnología, Universidad Bernardo OHiggins, Santiago, Chile;1. Department of Computer Science, Birzeit University, POBox 14, West Bank, Palestine;2. Institute for Integrated and Intelligent Systems, Griffith University, Nathan Campus, Brisbane, QLD 4111, Australia;1. School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China;2. School of Computing, Telecommunications and Networks, Birmingham City University, Birmingham B42 2SU, UK
Abstract:Data from many real-world applications can be high dimensional and features of such data are usually highly redundant. Identifying informative features has become an important step for data mining to not only circumvent the curse of dimensionality but to reduce the amount of data for processing. In this paper, we propose a novel feature selection method based on bee colony and gradient boosting decision tree aiming at addressing problems such as efficiency and informative quality of the selected features. Our method achieves global optimization of the inputs of the decision tree using the bee colony algorithm to identify the informative features. The method initializes the feature space spanned by the dataset. Less relevant features are suppressed according to the information they contribute to the decision making using an artificial bee colony algorithm. Experiments are conducted with two breast cancer datasets and six datasets from the public data repository. Experimental results demonstrate that the proposed method effectively reduces the dimensions of the dataset and achieves superior classification accuracy using the selected features.
Keywords:Bee colony algorithm  Decision tree  Feature selection  Dimensionality reduction
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号