随机森林理论浅析 A Brief Theoretical Overview of Random Forests期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

随机森林理论浅析

引用本文：	董师师,黄哲学. 随机森林理论浅析[J]. 集成技术, 2013, 2(1): 1-7

作者姓名：	董师师黄哲学

作者单位：	深圳市高性能数据挖掘重点实验室;中国科学院深圳先进技术研究院

摘要：	随机森林是一种著名的集成学习方法,被广泛应用于数据分类和非参数回归。本文对随机森林算法的主要理论进行阐述,包括随机森林收敛定理、泛化误差界以和袋外估计三个部分。最后介绍一种属性加权子空间抽样的随机森林改进算法,用于解决超高维数据的分类问题。
关键词：	随机森林数据挖掘机器学习
A Brief Theoretical Overview of Random Forests

Dong Shishi,Huang Zhexue. A Brief Theoretical Overview of Random Forests[J]. , 2013, 2(1): 1-7

Authors:	Dong Shishi Huang Zhexue

Affiliation:	1,2 1( Shenzhen Key Laboratory of High Performance Data Mining, Shenzhen518055,China ) 2( Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen518055,China )

Abstract:	Random Forests is an important ensemble learning method and it is widely used in data classification and nonparametric regression. In this paper, we review three main theoretical issues of random forests, i.e., the convergence theorem, the generalization error bound and the out-of-bag estimation. In the end, we present an improved Random Forests algorithm, which uses a feature weighting sampling method to sample a subset of features at each node in growing trees. The new method is suitable to solve classification problems of very high dimensional data.

Keywords:	random forests data mining machine learning
本文献已被 CNKI 等数据库收录！
	点击此处可从《集成技术》浏览原始摘要信息
	点击此处可从《集成技术》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏