首页 | 本学科首页   官方微博 | 高级检索  
     

缺失标记下基于类属属性的多标记特征选择
引用本文:张志浩,林耀进,卢舜,郭晨,王晨曦.缺失标记下基于类属属性的多标记特征选择[J].计算机应用,2021,41(10):2849-2857.
作者姓名:张志浩  林耀进  卢舜  郭晨  王晨曦
作者单位:1. 闽南师范大学 计算机学院, 福建 漳州 363000;2. 数据科学与智能应用福建省高校重点实验室(闽南师范大学), 福建 漳州 363000
基金项目:国家自然科学基金面上项目(62076116);福建省自然科学基金面上项目(2020J01811)。
摘    要:多标记特征选择已在图像分类、疾病诊断等领域得到广泛应用;然而,现实中数据的标记空间往往存在部分标记缺失的问题,这破坏了标记间的结构性和关联性,使得学习算法难以准确地选择重要特征。针对此问题,提出一种缺失标记下基于类属属性的多标记特征选择(MFSLML)算法。首先,通过利用稀疏学习方法获取每个类标记的类属属性;同时基于线性回归模型构建类属属性与标记的映射关系,以用于恢复缺失标记;最后,选取7组数据集以及4个评价指标进行实验。实验结果表明:相比基于最大依赖度和最小冗余度的多标记特征选择算法(MDMR)和基于特征交互的多标记特征选择算法(MFML)等一些先进的多标记特征选择算法,MFSLML在平均查准率指标上能够提升4.61~5.5个百分点,由此可见MFSLML具有更优的分类性能。

关 键 词:特征选择  类属属性  缺失标记  线性回归  多标记学习  
收稿时间:2020-12-03
修稿时间:2021-03-01

Multi-label feature selection based on label-specific feature with missing labels
ZHANG Zhihao,LIN Yaojin,LU Shun,GUO Chen,WANG Chenxi.Multi-label feature selection based on label-specific feature with missing labels[J].journal of Computer Applications,2021,41(10):2849-2857.
Authors:ZHANG Zhihao  LIN Yaojin  LU Shun  GUO Chen  WANG Chenxi
Affiliation:1. School of Computer Science, Minnan Normal University, Zhangzhou Fujian 363000, China;2. Key Laboratory of Data Science and Intelligence Application of Fujian Provincial Universities(Minnan Normal University), Zhangzhou Fujian 363000, China
Abstract:Multi-label feature selection has been widely used in many domains, such as image classification and disease diagnosis. However, there usually exist missing labels in the label space of data in practice, which destroys the structure and correlation between labels, so that the learning algorithms are difficult to exactly select important features. To address this problem, a Multi-label Feature Selection based on Label-specific feature with Missing Labels (MFSLML) algorithm was proposed. Firstly, the label-specific feature for each class label was obtained via sparse learning method. At the same time, the mapping relations between labels and label-specific features were constructed based on linear regression model, and were used to recover the missing labels. Finally, experiments were performed on 7 datasets with using 4 evaluation metrics. Experimental results show that compared to some state-of-the-art multi-label feature selection algorithms, such as multi-label feature selection algorithm based Max-Dependency and Min-Redundancy (MDMR) and the Multi-label Feature selection with Missing Labels via considering feature interaction (MFML), MFSLML can increase the average precision by 4.61-5.5 percentage points. It can be seen that MFSLML achieves better classification performance.
Keywords:feature selection  label-specific feature  missing label  linear regression  multi-label learning  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号