首页 | 本学科首页   官方微博 | 高级检索  
     

随机森林理论浅析
引用本文:董师师,黄哲学. 随机森林理论浅析[J]. 集成技术, 2013, 2(1): 1-7
作者姓名:董师师  黄哲学
作者单位:深圳市高性能数据挖掘重点实验室;中国科学院深圳先进技术研究院
摘    要:随机森林是一种著名的集成学习方法,被广泛应用于数据分类和非参数回归。本文对随机森林算法的主要理论进行阐述,包括随机森林收敛定理、泛化误差界以和袋外估计三个部分。最后介绍一种属性加权子空间抽样的随机森林改进算法,用于解决超高维数据的分类问题。

关 键 词:随机森林  数据挖掘  机器学习

A Brief Theoretical Overview of Random Forests
Dong Shishi,Huang Zhexue. A Brief Theoretical Overview of Random Forests[J]. , 2013, 2(1): 1-7
Authors:Dong Shishi  Huang Zhexue
Affiliation:1,2 1( Shenzhen Key Laboratory of High Performance Data Mining, Shenzhen518055,China ) 2( Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen518055,China )
Abstract:Random Forests is an important ensemble learning method and it is widely used in data classification and nonparametric regression. In this paper, we review three main theoretical issues of random forests, i.e., the convergence theorem, the generalization error bound and the out-of-bag estimation. In the end, we present an improved Random Forests algorithm, which uses a feature weighting sampling method to sample a subset of features at each node in growing trees. The new method is suitable to solve classification problems of very high dimensional data.
Keywords:random forests   data mining   machine learning
本文献已被 CNKI 等数据库收录!
点击此处可从《集成技术》浏览原始摘要信息
点击此处可从《集成技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号