首页 | 本学科首页   官方微博 | 高级检索  
     

基于多类指数损失函数逐步添加模型的改进多分类AdaBoost算法
引用本文:翟夕阳,王晓丹,雷蕾,魏晓辉.基于多类指数损失函数逐步添加模型的改进多分类AdaBoost算法[J].计算机应用,2017,37(6):1692-1696.
作者姓名:翟夕阳  王晓丹  雷蕾  魏晓辉
作者单位:1. 空军工程大学 防空反导学院, 西安 710051;2. 解放军第463医院, 沈阳 110042
基金项目:国家自然科学基金资助项目(61273275,61503407)。
摘    要:多类指数损失函数逐步添加模型(SAMME)是一种多分类的AdaBoost算法,为进一步提升SAMME算法的性能,针对使用加权概率和伪损失对算法的影响进行研究,在此基础上提出了一种基于基分类器对样本有效邻域分类的动态加权AdaBoost算法SAMME.RD。首先,确定是否使用加权概率和伪损失;然后,求出待测样本在训练集中的有效邻域;最后,根据基分类器针对有效邻域的分类结果确定基分类器的加权系数。使用UCI数据集进行验证,实验结果表明:使用真实的错误率计算基分类器加权系数效果更好;在数据类别较少且分布平衡时,使用真实概率进行基分类器筛选效果较好;在数据类别较多且分布不平衡时,使用加权概率进行基分类器筛选效果较好。所提的SAMME.RD算法可以有效提高多分类AdaBoost算法的分类正确率。

关 键 词:集成学习  多分类  AdaBoost算法  多类指数损失函数逐步添加模型(SAMME)  动态加权融合  
收稿时间:2016-11-21
修稿时间:2017-01-10

Improved multi-class AdaBoost algorithm based on stagewise additive modeling using a multi-class exponential loss function
ZHAI Xiyang,WANG Xiaodan,LEI Lei,WEI Xiaohui.Improved multi-class AdaBoost algorithm based on stagewise additive modeling using a multi-class exponential loss function[J].journal of Computer Applications,2017,37(6):1692-1696.
Authors:ZHAI Xiyang  WANG Xiaodan  LEI Lei  WEI Xiaohui
Affiliation:1. Institute of Air Defense and Anti-Missile, Air Force Engineering University, Xi'an Shaanxi 710051, China;2. Hospital 463 of PLA, Shenyang Liaoning 110042, China
Abstract:Stagewise Additive Modeling using a Multi-class Exponential loss function (SAMME) is a multi-class AdaBoost algorithm. To further improve the performance of SAMME, the influence of using weighed error rate and pseudo loss on SAMME algorithm was studied, and a dynamic weighted Adaptive Boosting (AdaBoost) algorithm named SAMME with Resampling and Dynamic weighting (SAMME.RD) algorithm was proposed based on the classification of sample's effective neighborhood area by using the base classifier. Firstly, it was determined that whether to use weighted probability and pseudo loss or not. Then, the effective neighborhood area of sample to be tested in the training set was found out. Finally, the weighted coefficient of the base classifier was determined according to the classification result of the effective neighborhood area based on the base classifier. The experimental results show that, the effect of calculating the weighted coefficient of the base classifier by using real error rate is better. The performance of selecting base classifier by using real probability is better when the dataset has less classes and its distribution is balanced. The performance of selecting base classifier by using weighed probability is better when the dataset has more classes and its distribution is imbalanced. The proposed SAMME.RD algorithm can improve the multi-class classification accuracy of AdaBoost algorithm effectively.
Keywords:ensemble learning  multi-class  Adaptive Boosting(AdaBoost) algorithm  Stagewise Additive Modeling using a Multi-class Exponential loss function (SAMME)  dynamic weighted fusion  
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号