首页 | 本学科首页   官方微博 | 高级检索  
     

融合分类信息的随机森林特征选择算法及应用
引用本文:武炜杰,张景祥. 融合分类信息的随机森林特征选择算法及应用[J]. 计算机工程与应用, 2021, 57(17): 147-156. DOI: 10.3778/j.issn.1002-8331.2008-0171
作者姓名:武炜杰  张景祥
作者单位:江南大学 理学院,江苏 无锡 214122
摘    要:针对传统随机森林随特征数增加计算消耗高的问题,提出了一种随机森林多特征置换算法.该算法对数据特征进行聚类,保持其他特征簇不变,逐一对同簇特征同时随机置换,得到全部特征簇的重要性得分及簇间排序.簇内特征按与分类信息的相关程度排序,引入相关性阈值选出重要特征,对剩余特征按先簇间、再簇内的规则进行排序.为了进一步比较该方法的...

关 键 词:特征选择  聚类  随机森林  多特征置换

Random Forest Feature Selection Algorithm Based on Categorization Information and Application
WU Weijie,ZHANG Jingxiang. Random Forest Feature Selection Algorithm Based on Categorization Information and Application[J]. Computer Engineering and Applications, 2021, 57(17): 147-156. DOI: 10.3778/j.issn.1002-8331.2008-0171
Authors:WU Weijie  ZHANG Jingxiang
Affiliation:School of Science, Jiangnan University, Wuxi, Jiangsu 214122, China
Abstract:Aiming at the problem of calculating high consumption of traditional random forest with the increase of feature number, a multi-feature permutation algorithm by random forest is proposed. All of features are clustered firstly, then the features in the same cluster are taken random permutation as the other clusters remain unchanged. The importance of all the feature-clusters are calculated and ranked. The feature in the same cluster is ranked by the correlation of itself and classification information. A correlation threshold is used to choose the important features. The rule of ranking the remaining feature is first between clusters, then within clusters. To further illustrate the effectiveness of the method, three correspondingly multi-feature permutation algorithms by random forest are designed based on K-mean, hierarchical and fuzzy C-mean clustering algorithms. The experimental results show that the proposed algorithm achieves higher classification accuracy with fewer features and higher time efficiency compared with the traditional random forest method.
Keywords:feature selection  cluster  random forest  multi-feature permutation  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号