共查询到20条相似文献,搜索用时 15 毫秒
1.
基于显著点特征多示例学习的图像检索方法 总被引:2,自引:0,他引:2
提出了一种基于图像显著点特征进行多示例学习(Multiple-instance learning)的图像检索方法.该方法对图像进行小波分解并跟踪不同尺度小波系数提取图像显著点;然后利用显著点特征进行检索,并在相关反馈中将图像看作多示例包,通过期望最大多样性密度(EM-DD,expectation maximization diverse density)方法进行多示例学习,获得体现图像语义的日标特征.在Corel和SIVAL两个图像库进行实验,结果表明该方法明显提高了检索的准确性. 相似文献
2.
《Journal of Visual Communication and Image Representation》2014,25(5):1130-1135
Content-based pornographic image detection, in which region-of-interest (ROI) plays an important role, is effective to filter pornography. Traditionally, skin-color regions are extracted as ROI. However, skin-color regions are always larger than the subareas containing pornographic parts, and the approach is difficult to differentiate between human skins and other objects with the skin-colors. In this paper, a novel approach of extracting salient region is presented for pornographic image detection. At first, a novel saliency map model is constructed. Then it is integrated with a skin-color model and a face detection model to capture ROI in pornographic images. Next, a ROI-based codebook algorithm is proposed to enhance the representative power of visual-words. Taking into account both the speed and the accuracy, we fuse speed up robust features (SURF) with color moments (CM). Experimental results show that the precision of our ROI extraction method averagely achieves 91.33%, more precisely than that of using the skin-color model alone. Besides, the comparison with the state-of-the-art methods of pornographic image detection shows that our approach is able to remarkably improve the performance. 相似文献
3.
Learning effective relevance measures plays a crucial role in improving the performance of content-based image retrieval (CBIR) systems. Despite extensive research efforts for decades, how to discover and incorporate semantic information of images still poses a formidable challenge to real-world CBIR systems. In this paper, we propose a novel hybrid textual-visual relevance learning method, which mines textual relevance from image tags and combines textual relevance and visual relevance for CBIR. To alleviate the sparsity and unreliability of tags, we first perform tag completion to fill the missing tags as well as correct noisy tags of images. Then, we capture users’ semantic cognition to images by representing each image as a probability distribution over the permutations of tags. Finally, instead of early fusion, a ranking aggregation strategy is adopted to sew up textual relevance and visual relevance seamlessly. Extensive experiments on two benchmark datasets well verified the promise of our approach. 相似文献
4.
Naif Alajlan Mohamed S. Kamel George Freeman 《Signal Processing: Image Communication》2006,21(10):904-918
We aim at developing a geometry-based retrieval system for multi-object images. We model both shape and topology of image objects including holes using a structured representation called curvature tree (CT); the hierarchy of the CT reflects the inclusion relationships between the objects and holes. To facilitate shape-based matching, triangle-area representation (TAR) of each object and hole is stored at the corresponding node in the CT. The similarity between two multi-object images is measured based on the maximum similarity subtree isomorphism (MSSI) between their CTs. For this purpose, we adapt a continuous optimization approach to solve the MSSI problem and a very effective dynamic programming algorithm to measure the similarity between the attributed nodes. Our matching scheme agrees with many recent findings in psychology about the human perception of multi-object images. Experiments on a database of 1500 logos and the MPEG-7 CE-1 database of 1400 shape images have shown the significance of the proposed method. 相似文献
5.
基于MPEG-7的图像检索模型研究 总被引:5,自引:0,他引:5
研究了基于内容的图像检索系统中的目标描述模型的建立方法。首先指出了目标描述模型是图像检索的关键技术,然后分析了MPEG-7草案中有关多媒体描述的基本术语、描述机制和MPEG-7的应用框架,最后基于MPEG-7提出了一种适合于图像检索的目标描述模型。该模型对提取出的多种视觉特征和相应的表示方法采用了分层结构。模型满足用户对所需特征进行不同级别检索的要求。 相似文献
6.
如何为内容丰富多变的大量图像数据编制索引并利用该索引进行高效地相似检索是研究的核心问题。相似图像检索系统通过图像特征提取器提取图像的特征,提供访问图像内容的方法;距离函数是用来计算这些特征之间相似程度的主要工具。实验证明,该系统可以高效地为用户检索出指定特征的图像,对实际应用具有重要的价值。 相似文献
7.
It is significant to detect and track soccer players in broadcast sports video, which is helpful to analysis player activity and team tactics. However, it is challenging to efficiently detect and track soccer players with shots switched and noise caused by auditorium and billboards. And for multi-player tracking how to treat the increase or decrease of player are also difficult. In this paper, a robust player detection algorithm based on salient region detection and tracking based on enhanced particle filtering are proposed. Salient region detection is used to segment sports fields, and then soccer players are detected by edge detection combined with Otsu algorithm. For soccer players tracking, we use an enhanced particle filter which we improve the algorithm in sample and the likelihood function combing the color feature and edge feature. Experimental results show the proposed algorithm can quickly and accurately detect and track soccer players in broadcast video. 相似文献
8.
Embedded colour image coding for content-based retrieval 总被引:1,自引:0,他引:1
9.
This paper proposes a novel image retrieval model based on monogenic signal representation. An original image is decomposed into three complementary components: amplitude, orientation and phase by monogenic signal representation. The monogenic variation in each local region and monogenic feature in each pixel are encoded, and then the statistical features of the local features encoded are calculated. In order to overcome the problem of high feature dimensionality, the local statistical features extracted from the complementary monogenic components are projected by block-based fisher discriminant analysis, which not only reduces the dimensionality of the features extracted, but also enhances its discriminative power. Finally, these features reduced are fused for effective image retrieval. Experimental results show that our scheme can effectively describe an image, and obviously improve the average retrieval precision. 相似文献
10.
针对单一特征不能很好地表述图像的问题,提出了一种融合多特征的图像检索算法.首先,提取查询图像和图像库中样本图像的GIST(Generalized Search Tree)特征,用欧氏距离衡量图像间的GIST相似度值,根据查询图像的GIST特征在图像库中进行检索,将结果按相似度进行排序;然后,提取查询图像和返回结果中前k幅图像的尺度不变特征变换(SIFT)特征,使用BBF(Best Bin First)算法进行特征匹配;最后,通过特征点匹配点对数排序并返回检索结果.实验在改进的Corel1000数据集上进行,与传统的单特征图像检索算法对比,提出的图像检索算法不仅提高了检索准确率,而且获得了较好的检索效率. 相似文献
11.
多示例学习对处理各类歧义问题有较好的效果,将它应用于周像检索问题,提出了一种新的基于多示例学习的图像检索方法。首先提取每幅图像的局部区域特征,通过对这些特征聚类求得一组基向量,并利用它们对每个局部特征向量进行编码,接着使用均值漂移聚类算法对图像进行分割,根据局部特征点位置所对应的分割块划分特征编码到相应的子集,最后将每组编码子集聚合成一个向量,这样每幅图像对应一个多示例包。根据用户选择的图像生成正包和反包,采用多示例学习算法进行学习,取得了较为满意的结果。 相似文献
12.
Combining positive and negative examples in relevance feedback for content-based image retrieval 总被引:2,自引:0,他引:2
M. L. Kherfi D. Ziou A. Bernardi 《Journal of Visual Communication and Image Representation》2003,14(4):428-457
In this paper, we address some issues related to the combination of positive and negative examples to improve the efficiency of image retrieval. We start by analyzing the relevance of the negative example and how it can be interpreted and utilized to mitigate certain problems in image retrieval, such as noise, miss, the page zero problem and feature selection. Then we propose a new relevance feedback approach that uses the positive example (PE) to perform generalization and the negative example (NE) to perform specialization. In this approach, a query containing both PE and NE is processed in two steps. The first step considers the PE alone, in order to reduce the set of images participating in retrieval to a more homogeneous subset. Then, the second step considers both PE and NE and acts on the images retained in the first step. Mathematically, relevance feedback is formulated as an optimization of the intra and inter variances of the PE and NE. The proposed relevance feedback algorithm was implemented in our image retrieval system, which we tested on a collection of more than 10,000 images. The experimental results show how the NE as considered in our model can contribute in improving the relevance of the images retrieved. 相似文献
13.
14.
15.
16.
17.
《Journal of Visual Communication and Image Representation》2014,25(6):1308-1323
Content-based image retrieval (CBIR) has been an active research topic in the last decade. As one of the promising approaches, salient point based image retrieval has attracted many researchers. However, the related work is usually very time consuming, and some salient points always may not represent the most interesting subset of points for image indexing. Based on fast and performant salient point detector, and the salient point expansion, a novel content-based image retrieval using local visual attention feature is proposed in this paper. Firstly, the salient image points are extracted by using the fast and performant SURF (Speeded-Up Robust Features) detector. Then, the visually significant image points around salient points can be obtained according to the salient point expansion. Finally, the local visual attention feature of visually significant image points, including the weighted color histogram and spatial distribution entropy, are extracted, and the similarity between color images is computed by using the local visual attention feature. Experimental results, including comparisons with the state-of-the-art retrieval systems, demonstrate the effectiveness of our proposal. 相似文献
18.
Relevance feedback (RF) is an effective approach to bridge the gap between low-level visual features and high-level semantic meanings in content-based image retrieval (CBIR). The support vector machine (SVM) based RF mechanisms have been used in different fields of image retrieval, but they often treat all positive and negative feedback samples equally, which will inevitably degrade the effectiveness of SVM-based RF approaches for CBIR. In fact, positive and negative feedback samples, different positive feedback samples, and different negative feedback samples all always have distinct properties. Moreover, each feedback interaction process is usually tedious and time-consuming because of complex visual features, so if too many times of iteration of feedback are asked, users may be impatient to interact with the CBIR system. To overcome the above limitations, we propose a new SVM-based RF approach using probabilistic feature and weighted kernel function in this paper. Firstly, the probabilistic features of each image are extracted by using principal components analysis (PCA) and the adapted Gaussian mixture models (AGMM) based dimension reduction, and the similarity is computed by employing Kullback–Leibler divergence. Secondly, the positive feedback samples and negative feedback samples are marked, and all feedback samples’ weight values are computed by utilizing the samples-based Relief feature weighting. Finally, the SVM kernel function is modified dynamically according to the feedback samples’ weight values. Extensive simulations on large databases show that the proposed algorithm is significantly more effective than the state-of-the-art approaches. 相似文献
19.
Md. Mahmudur Rahman Prabir Bhattacharya Bipin C. Desai 《Journal of Visual Communication and Image Representation》2009,20(7):450-462
This paper presents a learning-based unified image retrieval framework to represent images in local visual and semantic concept-based feature spaces. In this framework, a visual concept vocabulary (codebook) is automatically constructed by utilizing self-organizing map (SOM) and statistical models are built for local semantic concepts using probabilistic multi-class support vector machine (SVM). Based on these constructions, the images are represented in correlation and spatial relationship-enhanced concept feature spaces by exploiting the topology preserving local neighborhood structure of the codebook, local concept correlation statistics, and spatial relationships in individual encoded images. Finally, the features are unified by a dynamically weighted linear combination of similarity matching scheme based on the relevance feedback information. The feature weights are calculated by considering both the precision and the rank order information of the top retrieved relevant images of each representation, which adapts itself to individual searches to produce effective results. The experimental results on a photographic database of natural scenes and a bio-medical database of different imaging modalities and body parts demonstrate the effectiveness of the proposed framework. 相似文献