视觉词汇的主成分线性编码方法 Principal Component Linear Coding for Visual Words期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

视觉词汇的主成分线性编码方法

引用本文：	艾浩军,张敏,方禹,赵梦蕾,李泰舟,王红霞.视觉词汇的主成分线性编码方法[J].软件学报,2013,24(S2):42-49.

作者姓名：	艾浩军张敏方禹赵梦蕾李泰舟王红霞

作者单位：	武汉大学计算机学院, 湖北武汉 430072;武汉大学国家多媒体软件工程技术研究中心, 湖北武汉 430072;武汉大学计算机学院, 湖北武汉 430072;武汉大学国家多媒体软件工程技术研究中心, 湖北武汉 430072;武汉大学计算机学院, 湖北武汉 430072;武汉大学国家多媒体软件工程技术研究中心, 湖北武汉 430072;武汉大学计算机学院, 湖北武汉 430072;武汉大学国家多媒体软件工程技术研究中心, 湖北武汉 430072;武汉大学计算机学院, 湖北武汉 430072;武汉大学国家多媒体软件工程技术研究中心, 湖北武汉 430072;武汉理工大学计算机科学与技术学院, 湖北武汉 430063

基金项目：	国家科技支撑计划（2012BAH35B03）

摘要：	针对视觉物体分类中视觉词汇局部线性编码缺少显著性检验和共线性分析的问题，提出了主成分线性编码方法，选择与特征点具有最强线性相关性的K近邻视觉单词，采用主成分多元线性回归方法以解决视觉单词的共线性问题，从而减小编码系数的偏差和不稳定，提高视觉物体分类的精度.依据图像量化结果的稀疏性是影响分类精度的重要因素，进一步对主成分线性编码得到的量化结果做稀疏性分析并进行能量正则化处理，提高分类效率.实验结果表明，与已有方法相比，平均分类正确率提高了1%以上.
关键词：	视觉词袋共线性主成分回归特征点归并能量正则化
收稿时间：	2012/6/15 0:00:00
修稿时间：	2013/7/22 0:00:00
Principal Component Linear Coding for Visual Words

AI Hao-Jun,ZHANG Min,FANG Yu,ZHAO Meng-Lei,LI Tai-Zhou and WANG Hong-Xia.Principal Component Linear Coding for Visual Words[J].Journal of Software,2013,24(S2):42-49.

Authors:	AI Hao-Jun ZHANG Min FANG Yu ZHAO Meng-Lei LI Tai-Zhou and WANG Hong-Xia

Affiliation:	School of Computer Science, Wuhan University, Wuhan 430072, China;National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan 430072, China;School of Computer Science, Wuhan University, Wuhan 430072, China;National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan 430072, China;School of Computer Science, Wuhan University, Wuhan 430072, China;National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan 430072, China;School of Computer Science, Wuhan University, Wuhan 430072, China;National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan 430072, China;School of Computer Science, Wuhan University, Wuhan 430072, China;National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan 430072, China;School of Computer Science and Technology, Wuhan University of Technology, Wuhan 430063, China

Abstract:	By means of significant test and co-linearity analysis, this paper proposes principal component linear encoding which selects the K-nearest neighbor visual word with the strongest linear correlation. The multiple linear regression method based on principal component is used to solve weak and instable coding caused by the visual words' co-linearity problem, improving the accuracy of the visual object classification effectively. Recognizing that the scarcity of the image quantify plays an important roles in the classification accuracy, the study analyzes the scarcity of the quantitative results obtained by the principal component linear encoding and then processes it with energy regularization to improve the classification efficiency further. The experimental results demonstrate that this method increases the recognition rate average over 1% than existing algorithms.

Keywords:	bag of visual words co-linearity principal component regression feature points merging energy regularization

	点击此处可从《软件学报》浏览原始摘要信息
	点击此处可从《软件学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏