基于贝叶斯模型的多标签分类算法 Multi-label classification algorithm based on Bayesian model期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于贝叶斯模型的多标签分类算法

引用本文：	张洛阳,毛嘉莉,刘斌,吴涛. 基于贝叶斯模型的多标签分类算法[J]. 计算机应用, 2016, 36(1): 52-56. DOI: 10.11772/j.issn.1001-9081.2016.01.0052

作者姓名：	张洛阳毛嘉莉刘斌吴涛

作者单位：	1. 西华师范大学计算机学院, 四川南充 637002;2. 华东师范大学软件学院, 上海 200241

基金项目：	四川省自然科学基金资助项目(14ZB0140)。

摘要：	针对二元关联法(BR)未考虑标签之间相关性,容易造成分类器输出在训练集中不存在或次数较少标签的不足,提出了基于贝叶斯模型的多标签分类算法(MLBM)和马尔可夫型多标签分类算法(MMLBM)。首先,建立仿真模型分析BR算法的不足,考虑到标签的取值应由属性置信度和标签置信度共同决定,提出MLBM。其中,通过传统的分类算法计算获得属性置信度,以及通过训练集得到标签置信度。然后,考虑到MLBM在计算属性置信度时必须考虑所有已分类的标签,分类器的性能容易受无关或弱关系的标签影响,所以使用马尔可夫模型简化置信度的计算提出了MMLBM。理论分析和仿真实验表明,与BR算法相比,MMLBM的平均分类精度在emotions数据集上提高约4.8%,在yeast数据集上提高约9.8%,在flags数据集上提高约7.3%。实验结果表明,当数据集中实例的标签基数较大时,相对于BR算法,MMLBM的准确性有较大的提升。
关键词：	多标签贝叶斯模型马尔可夫模型 K近邻置信度
收稿时间：	2015-09-05
修稿时间：	2015-09-25
Multi-label classification algorithm based on Bayesian model

ZHANG Luoyang,MAO Jiali,LIU Bin,WU Tao. Multi-label classification algorithm based on Bayesian model[J]. Journal of Computer Applications, 2016, 36(1): 52-56. DOI: 10.11772/j.issn.1001-9081.2016.01.0052

Authors:	ZHANG Luoyang MAO Jiali LIU Bin WU Tao

Affiliation:	1. College of Computer, China West Normal University, Nanchong Sichuan 637002, China;2. College of Software, East China Normal University, Shanghai 200241, China

Abstract:	Since the relation of labels in Binary Relevance (BR) is ignored, it is easy to cause the multi-label classifier to output not exist or less emergent labels in training data. The Multi-Label classification algorithm based on Bayesian Model (MLBM) and Markov Multi-Label classification algorithm based on Bayesian Model (MMLBM) were proposed. Firstly, to analyze the shortcomings of BR algorithm, the simulation model was established; considering the value of label should be decided by the attribute confidence and label confidence, MLBM was proposed. Particularly, the attribute confidence was calculated by traditional classification and the label confidence was obtained directly from the training data. Secondly, when MLBM calculated label confidence, it had to consider all the classified labels, thus some of no-relation or weak-relation labels would affect performance of the classifier. To overcome the weakness of MLBM, MMLBM was proposed, which used Markov model to simplify the calculation of label confidence. The theoretical analyses and simulation experiment results demonstrate that, in comparison with BR algorithm, the average classification accuracy of MMLBM increased by 4.8% on emotions dataset, 9.8% on yeast dataset and 7.3% on flags dataset. The experimental results show that MMLBM can effectively improve the classification accuracy when the label cardinality is larger in the training data.

Keywords:	multi-label Bayesian model Markov model K Nearest Neighbor (KNN) confidence
本文献已被万方数据等数据库收录！
	点击此处可从《计算机应用》浏览原始摘要信息
	点击此处可从《计算机应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏