首页 | 本学科首页   官方微博 | 高级检索  
     

结合标签相关性和不均衡性的多标签学习模型
引用本文:姜思羽,钟晓玲,邱少健,宋恒杰.结合标签相关性和不均衡性的多标签学习模型[J].哈尔滨工业大学学报,2019,51(1):142-149.
作者姓名:姜思羽  钟晓玲  邱少健  宋恒杰
作者单位:华南理工大学 软件学院,广州,510006;华南理工大学 计算机科学与工程学院,广州,510006
摘    要:针对现有多标签学习算法较少兼顾标签间关联性和不平衡性的问题,提出一种同时考虑多标签间相关性与多标签不平衡问题的学习模型(A Multi-label Learning Model based on Label Correlation and Imbalance,MLCI).该学习模型针对每个标签类别,通过耦合其他标签类别以考量标签间的关联性,并降低缓解标签间不均衡比率,MLCI是一个将当前标签的二类不平衡学习器和多个与其他标签耦合的多类不平衡学习器结合的集成分类器.采用7种常用的多标签算法作为对比算法,针对yeast、scene、emotions和CAL500这4个开放数据集进行分类处理.实验结果表明,MLCI相比其他对比算法,在精度均值(Average-Precision)、排序损失(Ranking-Loss)、宏观平均AUC(Macro-Averaging AUC)和微观平均AUC(Micro-Averaging AUC) 4个性能评估指标上总体占明显优势.

关 键 词:多标签学习  标签相关性  类不平衡  不平衡分类器  集成分类器
收稿时间:2018/11/1 0:00:00

Amulti-label learning model based on label correlation and imbalance
JIANG Siyu,ZHONG Xiaoling,QIU Shaojian and SONG Hengjie.Amulti-label learning model based on label correlation and imbalance[J].Journal of Harbin Institute of Technology,2019,51(1):142-149.
Authors:JIANG Siyu  ZHONG Xiaoling  QIU Shaojian and SONG Hengjie
Affiliation:School of Software Engineering, South China University of Technology, Guangzhou 510006, China,School of Software Engineering, South China University of Technology, Guangzhou 510006, China,School of Computer Science & Engineering, South China University of Technology, Guangzhou 510006, China and School of Software Engineering, South China University of Technology, Guangzhou 510006, China
Abstract:Since the existing multi-label learning algorithms pay less attention to the problem of correlation and imbalance between label sets, this paper proposes a multi-label learning model based on label correlation and imbalance (MLCI). By coupling other label categories to consider the correlation among labels and reducing the imbalance ratio between labels of instances, the learning model is for each label category, and it is an ensemble classifier that combines the current class of binary imbalance classifier with multiple imbalanced classifiers coupled to other labels. In this paper, seven commonly used multi-label algorithms are used as comparison algorithms to classify the four open datasets of yeast, scene, emotions and CAL500. The experimental results show that the MLCI has obvious advantages in accuracy precision, ranking-Loss, macro-averaging AUC and micro-averaging AUC.
Keywords:Multi-label learning  label correlation  label imbalance  imbalance classifier  ensemble classifier
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《哈尔滨工业大学学报》浏览原始摘要信息
点击此处可从《哈尔滨工业大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号