首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于HDDT集成的多类不平衡学习方法
引用本文:钱祺,姜远.一种基于HDDT集成的多类不平衡学习方法[J].微电子学与计算机,2011,28(10).
作者姓名:钱祺  姜远
作者单位:南京大学计算机软件新技术国家重点实验室,江苏南京,210093
基金项目:国家自然科学基金(60975043); 中央高校基本科研业务费专项资金(1115020204,1116020206)
摘    要:在很多真实世界问题中,不同类别的数据样本往往有显著的不平衡性,即大类的样本远多于小类.对类别不平衡样本进行学习,是目前国内外数据挖掘和机器学习领域的研究热点之一.以往对不平衡样本学习的研究主要针对二分类问题进行,由此针对多分类问题,提出一种基于HDDT决策树集成的多类不平衡学习方法.实验表明,该方法可以有效地对多类不平衡问题进行学习.

关 键 词:机器学习  数据挖掘  类别不平衡学习  多分类

A Multi-Class Imbalance Learning Method Based on HDDT Ensemble
QIAN Qi,JIANG Yuan.A Multi-Class Imbalance Learning Method Based on HDDT Ensemble[J].Microelectronics & Computer,2011,28(10).
Authors:QIAN Qi  JIANG Yuan
Affiliation:QIAN Qi,JIANG Yuan(National Key Lab for Novel Software Technology,Nanjing University,Nanjing 210093,China)
Abstract:In many real world applications,the number of examples from different class is significantly different,which means the number of examples in major class is much larger than that of minor class.Therefore,learning from imbalanced data set has received much attention of machine learning and data mining community.Considering that most of previous research focus on binary class problem,this paper proposes a multi-class imbalance method based on HDDT ensemble.Empirical study shows that the method is effective for...
Keywords:machine learning  data mining  class-imbalance learning  multi-class classification  
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号