首页 | 本学科首页   官方微博 | 高级检索  
     

基于新的森林优化算法的特征选择算法
引用本文:谢琪,徐旭,程耕国,陈和平.基于新的森林优化算法的特征选择算法[J].计算机应用,2020,40(5):1266-1271.
作者姓名:谢琪  徐旭  程耕国  陈和平
作者单位:1.武汉科技大学 信息科学与工程学院,武汉 430081 2.格勒诺布尔高等商学院 高等商业学院,格勒诺布尔 38000, 法国
基金项目:国家自然科学基金资助项目(61702381, 61602351)。
摘    要:针对传统的基于森林优化算法的特征选择算法在初始化阶段、候选森林生成阶段和更新阶段存在的问题,提出了一种新的基于森林优化算法的特征选择算法。该算法在初始化阶段采用皮尔森相关系数和L1正则化方法代替随机初始化策略;在候选森林生成阶段,采用优劣树分开和差额补足的方法解决优劣树不完备问题;在更新阶段,将与最优树精度相同但维度不同的树木添加到森林中。在实验中,所提算法采用与传统的基于森林优化算法的特征选择算法相同的实验数据和实验参数,分别测试了小维度、中维度和大维度数据。实验结果表明,在2个大维度数据和2个中维度数据上,所提算法的分类精度和维度缩减能力均高于传统的基于森林优化算法的特征选择算法。实验结果验证了所提算法在处理特征选择问题的有效性。

关 键 词:特征选择  L1正则化  候选森林  更新机制  森林优化算法
收稿时间:2019-09-23
修稿时间:2019-10-16

Feature selection algorithm based on new forest optimization algorithm
XIE Qi,XU Xu,CHENG Gengguo,CHEN Heping.Feature selection algorithm based on new forest optimization algorithm[J].journal of Computer Applications,2020,40(5):1266-1271.
Authors:XIE Qi  XU Xu  CHENG Gengguo  CHEN Heping
Affiliation:1.School of Information Science and Engineering, Wuhan University of Science and Technology, WuhanHubei 430081, China
2.Graduate School of Business, Grenoble Graduate School of Business, Grenoble 38000, France
Abstract:A new feature selection algorithm using forest optimization algorithm was proposed, which aimed at solving the problems of the traditional feature selection using forest optimization algorithm in the stages of initialization, candidate forest generation and updating. In the algorithm, Pearson correlation coefficient and L1 regularization method were used to replace the random initialization strategy in the initialization stage, the methods of separating good and bad trees and fulfilling the difference were used to solve the problems of incompletion of good and bad trees in the candidate forest generation stage, and trees having the same precision but different dimension with the optimal tree were added to the forest in the updating stage. In the experiments, with the same experimental data and experimental parameters, the proposed algorithm and the traditional feature selection using forest optimization algorithm were used to test the small, medium and large dimension data respectively. The experimental results show that the proposed algorithm is better than the traditional feature selection using forest optimization algorithm in the classification performance and dimension reduction ability on two medium and two large dimension data. The experimental results prove the effectiveness of the proposed algorithm in solving feature selection problems.
Keywords:feature selection  L1 regularization  candidate forest  updating mechanism  forest optimization algorithm  
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号