首页 | 本学科首页   官方微博 | 高级检索  
     

基于标签关联性的多标签Scratch分类算法
引用本文:彭聪,孙岩,戚鹏. 基于标签关联性的多标签Scratch分类算法[J]. 北京邮电大学学报, 2019, 42(6): 134-141. DOI: 10.13190/j.jbupt.2019-126
作者姓名:彭聪  孙岩  戚鹏
作者单位:北京邮电大学计算机学院,北京100876;北京邮电大学计算机学院,北京100876;北京邮电大学计算机学院,北京100876
基金项目:国家自然科学基金项目(61672109,61772085,61877005)
摘    要:为了实现Scratch可视化编程领域的作品分类,提出了一种基于标签关联性的多标签分类算法(MLLR),构建了一个有效的多标签Scratch分类模型.首先提取作品的Block使用特征、计算思维技能特征和复杂度特征3类特征作为分类特征;然后针对RAKEL算法随机选择标签子集,忽略了标签间的关联性,提出了改进的MLLR算法,该方法根据多标签之间的关联性来划分标签子集,再训练相应的标签幂集子分类器.实验结果表明,MLLR算法在分类性能和时间性能上优于RAKEL等多标签分类算法,构建的分类模型对于Scratch作品具有较强的适用性,分类的准确率达到81.3%.

关 键 词:Scratch  标签关联性  多标签分类  分类模型
收稿时间:2019-11-22

Label Relevance Based Multi-Label Scratch Classification Algorithm
PENG Cong,SUN Yan,QI Peng. Label Relevance Based Multi-Label Scratch Classification Algorithm[J]. Journal of Beijing University of Posts and Telecommunications, 2019, 42(6): 134-141. DOI: 10.13190/j.jbupt.2019-126
Authors:PENG Cong  SUN Yan  QI Peng
Affiliation:School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, China
Abstract:In order to implement the classification of projects in visual programming field of Scratch, a multi-label classification algorithm (MLLR) appears based on label relevance. An effective multi-label classification model for Scratch projects was constructed. Firstly, the block usage features, the computational thinking skill features and the Halstead features of projects are extracted as classification features. Then, the RAKEL algorithm randomly chooses label subsets, ignoring the relevance between labels, thereafter an improved MLLR algorithm was proposed. This method divides label subsets according to the relevance between multiple labels, and then trains the corresponding label power set sub-classifiers. Experiments show that MLLR algorithm is superior to RAKEL and other multi-label classification algorithms in classification performance and time performance, The classification model constructed has a strong applicability for Scratch projects, and the accuracy of classification reaches 81.3%.
Keywords:Scratch  label relevance  multi-label classification  classification model  
本文献已被 万方数据 等数据库收录!
点击此处可从《北京邮电大学学报》浏览原始摘要信息
点击此处可从《北京邮电大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号