首页 | 本学科首页   官方微博 | 高级检索  
     


Learning model order from labeled and unlabeled data for partially supervised classification, with application to word sense disambiguation
Authors:Zheng-Yu Niu  Dong-Hong Ji  Chew Lim Tan  
Affiliation:aInstitute for Infocomm Research, Mail Box B023, 21 Heng Mui Keng Terrace, Singapore 119613, Singapore;bDepartment of Computer Science, National University of Singapore, 3 Science Drive 2, Singapore 117543, Singapore
Abstract:Previous partially supervised classification methods can partition unlabeled data into positive examples and negative examples for a given class by learning from positive labeled examples and unlabeled examples, but they cannot further group the negative examples into meaningful clusters even if there are many different classes in the negative examples. Here we proposed an automatic method to obtain a natural partitioning of mixed data (labeled data + unlabeled data) by maximizing a stability criterion defined on classification results from an extended label propagation algorithm over all the possible values of model order (or the number of classes) in mixed data. Our experimental results on benchmark corpora for word sense disambiguation task indicate that this model order identification algorithm with the extended label propagation algorithm as the base classifier outperforms SVM, a one-class partially supervised classification algorithm, and the model order identification algorithm with semi-supervised k-means clustering as the base classifier when labeled data is incomplete.
Keywords:Word sense disambiguation  Partially supervised classification  Semi-supervised clustering
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号