首页 | 本学科首页   官方微博 | 高级检索  
     

基于浮动阈值分类器组合的多标签分类算法
引用本文:张丹普,付忠良,王莉莉,李昕.基于浮动阈值分类器组合的多标签分类算法[J].计算机应用,2015,35(1):147-151.
作者姓名:张丹普  付忠良  王莉莉  李昕
作者单位:1. 中国科学院 成都计算机应用研究所, 成都610041; 2. 中国科学院大学, 北京100049
基金项目:四川省科技支撑计划项目(2011GZ0171;2012GZ0106)
摘    要:针对目标可以同时属于多个类别的多标签分类问题,提出了一种基于浮动阈值分类器组合的多标签分类算法.首先,分析探讨了基于浮动阈值分类器的AdaBoost算法(AdaBoost.FT)的原理及错误率估计,证明了该算法能克服固定分段阈值分类器对分类边界附近点分类不稳定的缺点从而提高分类准确率;然后,采用二分类(BR)方法将该单标签学习算法应用于多标签分类问题,得到基于浮动阈值分类器组合的多标签分类方法,即多标签AdaBoost.FT.实验结果表明,所提算法的平均分类精度在Emotions数据集上比AdaBoost.MH、ML-kNN、RankSVM这3种算法分别提高约4%、8%、11%;在Scene、Yeast数据集上仅比RankSVM低约3%、1%.由实验分析可知,在不同类别标记之间基本没有关联关系或标签数目较少的数据集上,该算法均能得到较好的分类效果.

关 键 词:连续AdaBoost  浮动阈值  极大似然原理  多标签分类  集成学习  二分类方法  
收稿时间:2014-08-01
修稿时间:2014-09-19

Multi-label classification algorithm based on floating threshold classifiers combination
ZHANG Danpu , FU Zhongliang , WANG Lili , LI Xin.Multi-label classification algorithm based on floating threshold classifiers combination[J].journal of Computer Applications,2015,35(1):147-151.
Authors:ZHANG Danpu  FU Zhongliang  WANG Lili  LI Xin
Affiliation:1. Chengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu Sichuan 610041, China;
2. University of Chinese Academy of Sciences, Beijing 100049, China
Abstract:To solve the multi-label classification problem that a target belongs to multiple classes, a new multi-label classification algorithm based on floating threshold classifiers combination was proposed. Firstly, the theory and error estimation of the AdaBoost algorithm with floating threshold (AdaBoost.FT) were analyzed and discussed, and it was proved that AdaBoost.FT algorithm could overcome the defect of unstabitily when the fixed segmentation threshold classifier was used to classify the points near classifying boundary, the classification accuracy of single-label classification algorithm was improved. And then, the Binary Relevance (BR) method was introduced to apply AdaBoost.FT algorithm into multi-label classification problem, and the multi-label classification algorithm based on floating threshold classifiers combination was presented, namely multi-label AdaBoost.FT. The experimental results show that the average precision of multi-label AdaBoost. FT outperforms the other three multi-label algorithms, AdaBoost.MH (multiclass, multi-label version of AdaBoost based on Hamming loss), ML-kNN (Multi-Label k-Nearest Neighbor), RankSVM (Ranking Support Vector Machine) about 4%, 8%, 11% respectively in Emotions dataset, and is just little worse than RankSVM about 3%, 1% respectively in Scene and Yeast datasets. The experimental analyses show that multi-label AdaBoost. FT can obtain the better classification results in the datasets which have small number of labels or whose different labels are irrelevant.
Keywords:real AdaBoost  floating threshold  maximum likelihood principle  multi-label classification  ensemble learning  Binary Relevance (BR) method
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号