首页 | 本学科首页   官方微博 | 高级检索  
     

半监督多示例核
引用本文:张钢,印鉴,程良伦,钟钦灵.半监督多示例核[J].计算机科学,2011,38(9):220-223.
作者姓名:张钢  印鉴  程良伦  钟钦灵
作者单位:(广东工业大学自动化学院广州510006)'(中山大学计算机科学系 广州 510275}; (黄埔职业技术学校数学系 广州 510731)
基金项目:本文受国家白然科学基金项目(U0935002) ,广东省白然科学基金项目(07117421,835100900 L000002} ,广东工业大学高教研究基金项目(2009D06}资助。
摘    要:在多示例学习中引入利用未标记示例的机制,能降低训练的成本并提高学习器的泛化能力。当前半监督多示例学习算法大部分是基于对包中的每一个示例进行标记,把多示例学习转化为一个单示例半监督学习问题。考虑到包的类标记由包中示例及包的结构决定,提出一种直接在包层次上进行半监督学习的多示例学习算法。通过定义多示例核,利用所有包(有标记和未标记)计算包层次的图拉普拉斯矩阵,作为优化目标中的光滑性惩罚项。在多示例核所张成的RKHS空间中寻找最优解被归结为确定一个经过未标记数据修改的多示例核函数,它能直接用在经典的核学习方法上。在实验数据集上对算法进行了测试,并和已有的算法进行了比较。实验结果表明,基于半监督多示例核的算法能够使用更少量的训练数据而达到与监督学习算法同样的精度,在有标记数据集相同的情况下利用未标记数据能有效地提高学习器的泛化能力。

关 键 词:多示例学习,半监督学习,多示例核,支持向量机

Semi-supervised Multi-instance Kernel
ZHANG Gang,YIN Jian,CHENG Liang-lun,ZHONG Qin-ling.Semi-supervised Multi-instance Kernel[J].Computer Science,2011,38(9):220-223.
Authors:ZHANG Gang  YIN Jian  CHENG Liang-lun  ZHONG Qin-ling
Affiliation:ZHANG Gang1,2 YIN Jian2 CHENG Liang-lun1 ZHONG Qin-ling3(Faculty of Automation,Guangdong University of Technology,Guangzhou 510006,China)1(Department of Computer Science,SUN YAT-SEN University,Guangzhou 510275,China)2(Department of Mathematics,Huangpu Vocational Technical School,Guangzhou 510731,China)3
Abstract:In multi-instance learning, mechanism of making use of unlabeled instance would cut down training cost and increase generalization ability of the learner. Current algorithms perform semi-supervised multi-instance learning mainly by labeling each instance in bags and transferring multi-instance learning problem to singlcinstance semi supervised ones. In this paper we introduced a bag-level semi supervised learning framework with the idea that bag's label is determined by its instances and structure. With definition of multi-instance kernel, all bags(labeled and unlabeled) were used to calculate bag-level graph laplacian, which is a penalization term added to the optimization goal. We turned this problem into an optimization problem in RKHS and got a modified multi instance kernel function by unlabeled data as result that can be directly used in traditional kernel learning framework. We performed experiment in ALOI and Internet image datasets and compareed it with related algorithms. Experiment result shows that the proposed method can get the same accuracy as supervised counterpart with less labeled bags, and with the same labeled training data set the proposed method is of higher generalization ability.
Keywords:Multi-instance learning  Semi-supervised learning  Multi-instance kernel  Support vector machine
本文献已被 CNKI 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号