首页 | 本学科首页   官方微博 | 高级检索  
     


An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging,Boosting, and Randomization
Authors:Dietterich  Thomas G
Affiliation:(1) Department of Computer Science, Oregon State University, Corvallis, OR 97331, USA
Abstract:Bagging and boosting are methods that generate a diverse ensemble of classifiers by manipulating the training data given to a ldquobaserdquo learning algorithm. Breiman has pointed out that they rely for their effectiveness on the instability of the base learning algorithm. An alternative approach to generating an ensemble is to randomize the internal decisions made by the base algorithm. This general approach has been studied previously by Ali and Pazzani and by Dietterich and Kong. This paper compares the effectiveness of randomization, bagging, and boosting for improving the performance of the decision-tree algorithm C4.5. The experiments show that in situations with little or no classification noise, randomization is competitive with (and perhaps slightly superior to) bagging but not as accurate as boosting. In situations with substantial classification noise, bagging is much better than boosting, and sometimes better than randomization.
Keywords:decision trees  ensemble learning  bagging  boosting  C4  5  Monte Carlo methods
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号