首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于稀疏编码的多核学习图像分类方法
引用本文:亓晓振,王庆.一种基于稀疏编码的多核学习图像分类方法[J].电子学报,2012,40(4):773-779.
作者姓名:亓晓振  王庆
作者单位:西北工业大学计算机学院,陕西西安,710072
基金项目:国家自然科学基金,博士后基金
摘    要: 本文提出一种基于稀疏编码的多核学习图像分类方法.传统稀疏编码方法对图像进行分类时,损失了空间信息,本文采用对图像进行空间金字塔多划分方式为特征加入空间信息限制.在利用非线性SVM方法进行图像分类时,空间金字塔的各层分别形成一个核矩阵,本文使用多核学习方法求解各个核矩阵的权重,通过核矩阵的线性组合来获取能够对整个分类集区分能力最强的核矩阵.实验结果表明了本文所提出图像分类方法的有效性和鲁棒性.对Scene Categories场景数据集可以达到83.10%的分类准确率,这是当前该数据集上能达到的最高分类准确率.

关 键 词:图像分类  多核学习  稀疏编码  空间金字塔
收稿时间:2010-10-19

An Image Classification Approach Based on Sparse Coding and Multiple Kernel Learning
QI Xiao-zhen , WANG Qing.An Image Classification Approach Based on Sparse Coding and Multiple Kernel Learning[J].Acta Electronica Sinica,2012,40(4):773-779.
Authors:QI Xiao-zhen  WANG Qing
Affiliation:(School of Computer Science and Engineering,Northwestern Polytechnical University,Xi’an,Shaanxi 710072,China)
Abstract:A novel image classification method based on sparse coding and multiple kernel learning is proposed in the paper.Traditional methods of image classification used common sparse coding but lose the spatial information.We add this spatial information by dividing the image with the spatial pyramid.With the nonlinear SVM for image classification,each level of spatial pyramid has its own kernel,and we adopt machine learning for the optimal trade-off between different kernels.A much more discriminative kernel can be seen as the linear combination of base kernels corresponding to different pyramid levels.The experiments on the benchmark dataset show the effectiveness and robustness of our method.The precision on scene categories dataset can reach 83.10%,and it is the best result comparing to the state-of-the-art work.
Keywords:image classification  multiple kernel learning(MKL)  sparse coding  spatial pyramid
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号