首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 35 毫秒
1.
2.
3.
4.
In this paper, we present a simple, yet very efficient global image representation for scene recognition. A scene image is represented by a histogram of local transforms, which is an extended version of census transform histogram. The local transforms include local difference sign and magnitude information. Due to strong constraints between neighboring transformed values, global structure information can be captured through the histogram and spatial pyramid representation. Principal component analysis is used to reduce the dimensionality and get a compact feature vector. Experimental results on three widely used datasets demonstrate that the proposed method could achieve competitive performance in terms of speed and accuracy.  相似文献   

5.
6.
传统潜在语义分析(Latent Semantic Analysis, LSA)方法无法获得场景目标空间分布信息和潜在主题的判别信息。针对这一问题提出了一种基于多尺度空间判别性概率潜在语义分析(Probabilistic Latent Semantic Analysis, PLSA)的场景分类方法。首先通过空间金字塔方法对图像进行空间多尺度划分获得图像空间信息,结合PLSA模型获得每个局部块的潜在语义信息;然后串接每个特定局部块中的语义信息得到图像多尺度空间潜在语义信息;最后结合提出的权值学习方法来学习不同图像主题间的判别信息,从而得到图像的多尺度空间判别性潜在语义信息,并将学习到的权值信息嵌入支持向量基(Support Vector Machine, SVM)分类器中完成图像的场景分类。在常用的三个场景图像库(Scene-13、Scene-15和Caltech-101)上的实验表明,该方法平均分类精度比现有许多state-of-art方法均优。验证了其有效性和鲁棒性。  相似文献   

7.
The sparse representation based classification methods has achieved significant performance in recent years. To fully exploit both the holistic and locality information of face samples, a series of sparse representation based methods in spatial pyramid structure have been proposed. However, there are still some limitations for these sparse representation methods in spatial pyramid structure. Firstly, all the spatial patches in these methods are directly aggregated with same weights, ignoring the differences of patches’ reliability. Secondly, all these methods are not quite robust to poses, expression and misalignment variations, especially in under-sampled cases. In this paper, a novel method named robust sparse representation based classification in an adaptive weighted spatial pyramid structure (RSRC-ASP) is proposed. RSRC-ASP builds a spatial pyramid structure for sparse representation based classification with a self-adaptive weighting strategy for residuals’ aggregation. In addition, three strategies, local-neighbourhood representation, local intra-class Bayesian residual criterion, and local auxiliary dictionary, are exploited to enhance the robustness of RSRC-ASP. Experiments on various data sets show that RSRC-ASP outperforms the classical sparse representation based classification methods especially for under-sampled face recognition problems.  相似文献   

8.
殷慧  曹永锋  孙洪 《自动化学报》2010,36(8):1099-1106
提出了多维金字塔表达算法, 并使用基于多维金字塔表达的AdaBoost实现了高分辨率合成孔径雷达(Synthetic aperture radar, SAR)图像的城区场景分类. 多维金字塔表达算法首先在局部特征的各维计算金字塔表达矢量, 再将所有的金字塔表达矢量连接起来构成多维金字塔表达矢量. 多维金字塔表达算法克服了金字塔表达算法在处理高维局部特征时, 遇到的输出金字塔表达矢量的区分力受计算效率制约的问题. 本文分别在一个TerraSAR-X图像库和一张大幅TerraSAR-X图像上比较基于金字塔表达的AdaBoost和基于多维金字塔表达的AdaBoost的分类性能. 实验结果表明, 与前者相比, 后者显著提高了计算效率同时保证了分类精度.  相似文献   

9.
Spatial pyramids have been successfully applied to incorporating spatial information into bag-of-words based image representation. However, a major drawback is that it leads to high dimensional image representations. In this paper, we present a novel framework for obtaining compact pyramid representation. First, we investigate the usage of the divisive information theoretic feature clustering (DITC) algorithm in creating a compact pyramid representation. In many cases this method allows us to reduce the size of a high dimensional pyramid representation up to an order of magnitude with little or no loss in accuracy. Furthermore, comparison to clustering based on agglomerative information bottleneck (AIB) shows that our method obtains superior results at significantly lower computational costs. Moreover, we investigate the optimal combination of multiple features in the context of our compact pyramid representation. Finally, experiments show that the method can obtain state-of-the-art results on several challenging data sets.  相似文献   

10.
11.
12.
Image classification usually requires complicated segmentation to separate foreground objects from the background scene. However, the statistical content of a background scene can actually provide very useful information for classification. In this paper, we propose a new hybrid pyramid kernel which incorporates local features extracted from both dense regular grids and interest points for image classification, without requiring segmentation. Features extracted from dense regular grids can better capture information about the background scene, while interest points detected at corners and edges can better capture information about the salient objects. In our algorithm, these two local features are combined in both the spatial and the feature-space domains, and are organized into pyramid representations. In order to obtain better classification accuracy, we fine-tune the parameters involved in the similarity measure, and we determine discriminative regions by means of relevance feedback. From the experimental results, we observe that our algorithm can achieve a 6.37 % increase in performance as compared to other pyramid-representation-based methods. To evaluate the applicability of the proposed hybrid kernel to large-scale databases, we have performed a cross-dataset experiment and investigated the effect of foreground/background features on each of the kernels. In particular, the proposed hybrid kernel has been proven to satisfy Mercer’s condition and is efficient in measuring the similarity between image features. For instance, the computational complexity of the proposed hybrid kernel is proportional to the number of features.  相似文献   

13.
14.
冯文刚 《自动化学报》2014,40(4):763-770
针对层次场景图像序列,本文提出了一种数据驱动的基于快速序列视觉表述任务(rapid serial visual presentation task,RSVP)的场景识别模型. 首先基于金字塔模型提取三层尺度图像块,然后构建包括全局和局部特征的词汇字典,接着分别利用生成模型和判决模型训练视觉词汇,最后通过神经网络从图像块标记中获得场景类别. 实验表明算法能够获得更为精确的分类结果.  相似文献   

15.
Scene classification is a complicated task, because it includes much content and it is difficult to capture its distribution.A novel hierarchical serial scene classification framework is presented in this paper. At first, we use hierarchical feature to present both the global scene and local patches containing specific objects. Hierarchy is presented by space pyramid match, and our own codebook is built by two different types of words. Secondly, we train the visual words by generative and discriminative methods respectively based on space pyramid match, which could obtain the local patch labels efficiently. Then, we use a neural network to simulate the human decision process, which leads to the final scene category from local labels. Experiments show that the hierarchical serial scene image representation and classification model obtains superior results with respect to accuracy.  相似文献   

16.
针对现有词包模型对目标识别性能的不足,对特征提取、图像表示等方面进行改进以提高目标识别的准确率。首先,以密集提取关键点的方式取代SIFT关键点提取,减少了计算时间并最大程度地描述了图像底层信息。然后采用尺度不变特征变换(Scale-invariant feature transform, SIFT)描述符和统一模式的局部二值模式(Local binary pattern,LBP)描述符描述关键点周围的形状特征和纹理特征,引入K-Means聚类算法分别生成视觉词典,然后将局部描述符进行近似局部约束线性编码,并进行最大值特征汇聚。分别采用空间金字塔匹配生成具有空间信息的直方图,最后将金字塔直方图相串联,形成特征的图像级融合,并送入SVM进行分类识别。在公共数据库中进行实验,实验结果表明,本文所提方法能取得较高的目标识别准确率。  相似文献   

17.
舒坚  胡茂林 《微机发展》2006,16(3):37-39
提出了一种智能地融合同一场景的多幅图像为一幅图像的方法,与原图像相比,产生的图像包含较少的噪声和更多的信息。首先对原始图像除去斑点噪声;然后,运用直方图均衡化表示图像的细节和最大化图像信息内容;第三步,运用金字塔将图像分解为子图像,利用图像的密度、空间频率等特征,寻找需要融合的图像的部分;第四步,对这些子图像进行配准,为融合做准备;第五步,对需要融合的子图像,计算每一幅的空间频率,对空间频率不同的配准子图像,将结合它们周围子图像的空间频率信息,给出这个子图像的频率(一般情况下,频率值选择一个具有最高值的像素);第六步,为了得到更多细节,用模糊插值方法扩大这幅子图像。  相似文献   

18.
19.
20.
空间金字塔颜色直方图在图像分类中的应用   总被引:1,自引:0,他引:1       下载免费PDF全文
颜色直方图在图像分类系统中有着重要的应用。针对像颜色直方图特征的空间关系,提出空间金字塔颜色直方图作为图像的特征表示。它结合了图像的全局特征以及分块特征的优点。使用支持向量机(SVM)以及常用的4种核函数进行了测试。在corel图像库上的实验结果表明,该特征可以有效地结合全局与空间特征,提高了图像的分类准确率。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号