首页 | 本学科首页   官方微博 | 高级检索  
     


ML-KNN: A lazy learning approach to multi-label learning
Authors:Min-Ling Zhang  [Author Vitae] [Author Vitae]
Affiliation:National Laboratory for Novel Software Technology, Nanjing University, Nanjing 210093, China
Abstract:Multi-label learning originated from the investigation of text categorization problem, where each document may belong to several predefined topics simultaneously. In multi-label learning, the training set is composed of instances each associated with a set of labels, and the task is to predict the label sets of unseen instances through analyzing training instances with known label sets. In this paper, a multi-label lazy learning approach named ML-KNN is presented, which is derived from the traditional K-nearest neighbor (KNN) algorithm. In detail, for each unseen instance, its K nearest neighbors in the training set are firstly identified. After that, based on statistical information gained from the label sets of these neighboring instances, i.e. the number of neighboring instances belonging to each possible class, maximum a posteriori (MAP) principle is utilized to determine the label set for the unseen instance. Experiments on three different real-world multi-label learning problems, i.e. Yeast gene functional analysis, natural scene classification and automatic web page categorization, show that ML-KNN achieves superior performance to some well-established multi-label learning algorithms.
Keywords:Machine learning  Multi-label learning  Lazy learning  K-nearest neighbor" target="_blank">K-nearest neighbor  Functional genomics  Natural scene classification  Text categorization
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号