期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Segmenting images using localized histograms and region merging 总被引：1，自引：0，他引：1

J. Ross Beveridge Joey Griffith Ralf R. Kohler Allen R. Hanson Edward M. Riseman 《International Journal of Computer Vision》1989,2(3):311-347

A working system for segmenting images of complex scenes is presented. The system integrates techniques that have evolved out of many years of research in low-level image segmentation at the University of Massachusetts and elsewhere. This paper documents the result of this historical evolution. Segmentations produced by the system are used extensively in related image interpretation research.The system first produces segmentations based upon an analysis of spatially localized feature histograms. These initial segmentations are then simplified using a region merging algorithm. Parameter selection for the local histogram segmentation algorithm is facilitated by mapping the multidimensional parameter space to a one-dimensional parameter which regulates region fragmentation. An extension of this algorithm to multiple features is also presented. Experience with roughly 100 images from different domains has shown the system to be robust and effective. Samples of these results are included. 相似文献

2.

Automatic segmentation of granular objects in images: Combining local density clustering and gradient-barrier watershed

Huiguang Yang Narendra AhujaAuthor Vitae 《Pattern recognition》2014

Blob or granular object recognition is an image processing task with a rich application background, ranging from cell/nuclei segmentation in biology to nanoparticle recognition in physics. In this study, we establish a new and comprehensive framework for granular object recognition. Local density clustering and connected component analysis constitute the first stage. To separate overlapping objects, we further propose a modified watershed approach called the gradient-barrier watershed, which better incorporates intensity gradient information into the geometrical watershed framework. We also revise the marker-finding procedure to incorporate a clustering step on all the markers initially found, potentially grouping multiple markers within the same object. The gradient-barrier watershed is then conducted based on those markers, and the intensity gradient in the image directly guides the water flow during the flooding process. We also propose an important scheme for edge detection and fore/background separation called the intensity moment approach. Experimental results for a wide variety of objects in different disciplines – including cell/nuclei images, biological colony images, and nanoparticle images – demonstrate the effectiveness of the proposed framework. 相似文献

3.

Extraction of sparse features of color images in recognizing objects

Quyen Bui T. T. Vu Thang T. Hong Keum-Shik 《International Journal of Control, Automation and Systems》2016,14(2):616-627

International Journal of Control, Automation and Systems - In this paper, we propose a new object recognition framework that combines Gabor energy filters, a visual cortex model in which single... 相似文献

4.

Extracting objects from range and radiance images 总被引：6，自引：0，他引：6

Yizhou Yu Ferencz A. Malik J. 《IEEE transactions on visualization and computer graphics》2001,7(4):351-364

In this paper, we present a pipeline and several key techniques necessary for editing a real scene captured with both cameras and laser range scanners. We develop automatic algorithms to segment the geometry from range images into distinct surfaces, register texture from radiance images with the geometry, and synthesize compact high-quality texture maps. The result is an object-level representation of the scene which can be rendered with modifications to structure via traditional rendering methods. The segmentation algorithm for geometry operates directly on the point cloud from multiple registered 3D range images instead of a reconstructed mesh. It is a top-down algorithm which recursively partitions a point set into two subsets using a pairwise similarity measure. The result is a binary tree with individual surfaces as leaves. Our image registration technique performs a very efficient search to automatically find the camera poses for arbitrary position and orientation relative to the geometry. Thus, we can take photographs from any location without precalibration between the scanner and the camera. The algorithms have been applied to large-scale real data. We demonstrate our ability to edit a captured scene by moving, inserting, and deleting objects 相似文献

5.

MOVIES: indexing moving objects by shooting index images

Jens Dittrich Lukas Blunschi Marcos Antonio Vaz Salles 《GeoInformatica》2011,15(4):727-767

With the exponential growth of moving objects data to the Gigabyte range, it has become critical to develop effective techniques for indexing, updating, and querying these massive data sets. To meet the high update rate as well as low query response time requirements of moving object applications, this paper takes a novel approach in moving object indexing. In our approach, we do not require a sophisticated index structure that needs to be adjusted for each incoming update. Rather, we construct conceptually simple short-lived index images that we only keep for a very short period of time (sub-seconds) in main memory. As a consequence, the resulting technique MOVIES supports at the same time high query rates and high update rates, trading this property for query result staleness. Moreover, MOVIES is the first main memory method supporting time-parameterized predictive queries. To support this feature, we present two algorithms: non-predictive MOVIES and predictive MOVIES. We obtain the surprising result that a predictive indexing approach—considered state-of-the-art in an external-memory scenario—does not scale well in a main memory environment. In fact, our results show that MOVIES outperforms state-of-the-art moving object indexes such as a main-memory adapted B^x-tree by orders of magnitude w.r.t. update rates and query rates. In our experimental evaluation, we index the complete road network of Germany consisting of 40,000,000 road segments and 38,000,000 nodes. We scale our workload up to 100,000,000 moving objects, 58,000,000 updates per second and 10,000 queries per second, a scenario at a scale unmatched by any previous work. 相似文献

6.

Motion analysis of articulated objects from monocular images 总被引：2，自引：0，他引：2

Zhang X Liu Y Huang TS 《IEEE transactions on pattern analysis and machine intelligence》2006,28(4):625-636

This paper presents a new method of motion analysis of articulated objects from feature point correspondences over monocular perspective images without imposing any constraints on motion. An articulated object is modeled as a kinematic chain consisting of joints and links, and its 3D joint positions are estimated within a scale factor using the connection relationship of two links over two or three images. Then, twists and exponential maps are employed to represent the motion of each link, including the general motion of the base link and the rotation of other links around their joints. Finally, constraints from image point correspondences, which are similar to that of the essential matrix in rigid motion, are developed to estimate the motion. In the algorithm, the characteristic of articulated motion, i.e., motion correlation among links, is applied to decrease the complexity of the problem and improve the robustness. A point pattern matching algorithm for articulated objects is also discussed in this paper. Simulations and experiments on real images show the correctness and efficiency of the algorithms. 相似文献

7.

SAR imagery segmentation by statistical region growing and hierarchical merging

E.A. Carvalho D.M. Ushizima F.N.S. Medeiros C.I.O. Martins R.C.P. Marques I.N.S. Oliveira 《Digital Signal Processing》2010,20(5):1365-1378

This paper presents an algorithm to segment synthetic aperture radar (SAR) images, corrupted by speckle noise. Most standard segmentation techniques may require speckle filtering previously. Our approach performs radar image segmentation using the original noisy pixels as input data, i.e. without any preprocessing step. The algorithm includes a statistical region growing procedure combined with hierarchical region merging. The region growing step oversegments the input radar image, thus enabling region aggregation by employing a combination of the Kolmogorov–Smirnov (KS) test with a hierarchical stepwise optimization (HSWO) algorithm for performance improvement. We have tested and assessed the proposed technique on artificially speckled image and real SAR data. 相似文献

8.

Model-based recognition of 3D objects from single images 总被引：1，自引：0，他引：1

Weiss I. Ray M. 《IEEE transactions on pattern analysis and machine intelligence》2001,23(2):116-128

In this work, we treat major problems of object recognition which have received relatively little attention lately. Among them are the loss of depth information in the projection from a 3D object to a single 2D image, and the complexity of finding feature correspondences between images. We use geometric invariants to reduce the complexity of these problems. There are no geometric invariants of a projection from 3D to 2D. However, given certain modeling assumptions about the 3D object, such invariants can be found. The modeling assumptions can be either a particular model or a generic assumption about a class of models. Here, we use such assumptions for single-view recognition. We find algebraic relations between the invariants of a 3D model and those of its 2D image under general projective projection. These relations can be described geometrically as invariant models in a 3D invariant space, illuminated by invariant “light rays,” and projected onto an invariant version of the given image. We apply the method to real images 相似文献

9.

Recognition of objects given by collections of multichannel images

M. M. Lange D. Yu. Stepanov 《Pattern Recognition and Image Analysis》2014,24(3):431-442

This paper proposes a multiclass metric classifier of composite objects given by collections of multichannel images that are generated by various sources. The classifier is constructed in a space of multilayer tree-structured representations of the objects and based on making a decision by weighted voting of template objects. In the set of the object representations, we define a family of embedded measures, which provides a scheme of fusing the sources and channels by the general weighted measure. A computational gain of a proposed guided search algorithm as compared with an exhaustive search decision algorithm is estimated analytically. An efficiency of the classifier is demonstrated by experimental estimates of recognition error rates for biometric composite objects that are produced by a couple of sources which generate grayscale images of signatures and color images of faces. The recognition error rates over the individual channels of the sources, as well as over the couple of the sources, are given. An advantage of the fusion scheme by the general weighted measure in relation to the known fusion scheme by voting decisions over individual channels is shown. 相似文献

10.

Algorithms and timing for identification of objects from 2-D images

W. Lin D. A. Fraser 《Concurrency and Computation》1991,3(4):325-331

相似文献

11.

PDF文件中可识别图像的提取

王津涛康晓东李玫王冬梅赵强《计算机工程与设计》2006,27(9):1539-1541

PDF（portable documentformat）文件是用于电子文档分发的理想格式,是全球电子文档分发的开放式标准.从PDF文件中提取可供识别的图像,有利于图像识别和信息处理.详细介绍了一种从PDF文件中提取可识别图像的方案. 相似文献

12.

Representation of enclosing surfaces from simple voxelized objects by means of a chain code

Eduardo Lemus Ernesto Bribiesca Edgar Garduño 《Pattern recognition》2014

相似文献

13.

Extraction of rules from natural objects for automated mechanicalprocessing

Gamage L.B. Gosine R.G. de Silva C.W. 《IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society》1996,26(1):105-120

In process applications, fast and accurate extraction of complex information from an object for the purpose of mechanical processing of that object, is often required. In this paper, a general rule-based approach is developed using a database of measurable geometric “features” and associated complex information. The rules relate the features to the complex processing information. During the on-line processing, the object features are measured and passed into the rule base. The output from the rule base is the complex information that is needed to process the object. A methodology is developed to generate probabilistic rules for the rule base using multivariate probability densities. A knowledge integration scheme is also developed which combines statistical knowledge with expert knowledge in order to improve the reliability and efficiency of information extraction. The rule generation methodology is implemented in a knowledge-based vision system for process information recognition. As an illustrative example, the problem of efficient head removal in an automated salmon processing plant is considered 相似文献

14.

Inferring region salience from binary and gray-level images

Yossi CohenAuthor VitaeRonen BasriAuthor Vitae 《Pattern recognition》2003,36(10):2349-2362

We introduce a method that uses contour fragments to highlight regions of interest. Our method obtains as input either a binary image or the gradient map of a gray-level image. It produces a saliency map that reflects for every point in the image our belief that it belongs to a salient region. Saliency is determined by criteria such as closure, convexity, and size. In addition, gaps in the boundaries of regions diminish their saliency. Explicit scale parameter determines the size of interest. The method is implemented by a convolution of the input edge image with a linear filter that specifies the region of influence of a contour point over the image. Experiments demonstrate the utility of the method for saliency and segmentation. 相似文献

15.

Learning what is where from unlabeled images: joint localization and clustering of foreground objects

Ashok Chandrashekar Lorenzo Torresani Richard Granger 《Machine Learning》2014,94(2):261-279

“What does it mean, to see? The plain man’s answer would be, to know what is where by looking.” This famous quote by David Marr (Vision: A Computational Investigation into the Human Representation and Processing of Visual Information, Freeman, New York, 1982) sums up the holy grail of vision: discovering what is present in the world, and where it is, from unlabeled images. In this paper we tackle this challenging problem by proposing a generative model of object formation and describe an efficient algorithm to automatically learn the parameters of the model from a collection of unlabeled images. Our algorithm discovers the objects and their spatial extents by clustering together images containing similar foregrounds. Our approach simultaneously solves for the image clusters, the foreground appearance models and the spatial regions containing the objects by optimizing a single likelihood function defined over the entire image collection. We describe two methods for efficient foreground localization: the first method does not require any bottom-up image segmentation and discovers the foreground region as a contiguous rectangular bounding box. The second method expresses the foreground as a collection of super-pixels generated through a bottom-up segmentation of the image. However, unlike previous methods, objects are not assumed to be encapsulated by a single segment. Evaluation on standard benchmarks and comparison with prior methods demonstrate that our approach achieves state-of-the-art results on the problem of unsupervised foreground localization and clustering. 相似文献

16.

SimLocator: robust locator of similar objects in images

Yan Kong Weiming Dong Xing Mei Xiaopeng Zhang Jean-Claude Paul 《The Visual computer》2013,29(9):861-870

Similar objects commonly appear in natural images, and locating and cutting out these objects can be tedious when using classical interactive image segmentation methods. In this paper, we propose SimLocator, a robust method oriented to locate and cut out similar objects with minimum user interaction. After extracting an arbitrary object template from the input image, candidate locations of similar objects are roughly detected by distinguishing the shape and color features of each image. A novel optimization method is then introduced to select accurate locations from the two sets of candidates. Additionally, a matting-based method is used to improve the results and to ensure that all similar objects are located in the image. Finally, a method based on alpha matting is utilized to extract the precise object contours. To ensure the performance of the matting operation, this work has developed a new method for foreground extraction. Experiments show that SimLocator is more robust and more convenient to use compared to other more advanced repetition detection and interactive image segmentation methods, in terms of locating similar objects in images. 相似文献

17.

Segmentation and classification of hyperspectral images using watershed transformation

Y. Tarabalka Author Vitae J. Chanussot^{Author Vitae} 《Pattern recognition》2010,43(7):2367-2379

Hyperspectral imaging, which records a detailed spectrum of light for each pixel, provides an invaluable source of information regarding the physical nature of the different materials, leading to the potential of a more accurate classification. However, high dimensionality of hyperspectral data, usually coupled with limited reference data available, limits the performances of supervised classification techniques. The commonly used pixel-wise classification lacks information about spatial structures of the image. In order to increase classification performances, integration of spatial information into the classification process is needed. In this paper, we propose to extend the watershed segmentation algorithm for hyperspectral images, in order to define information about spatial structures. In particular, several approaches to compute a one-band gradient function from hyperspectral images are proposed and investigated. The accuracy of the watershed algorithms is demonstrated by the further incorporation of the segmentation maps into a classifier. A new spectral-spatial classification scheme for hyperspectral images is proposed, based on the pixel-wise Support Vector Machines classification, followed by majority voting within the watershed regions. Experimental segmentation and classification results are presented on two hyperspectral images. It is shown in experiments that when the number of spectral bands increases, the feature extraction and the use of multidimensional gradients appear to be preferable to the use of vectorial gradients. The integration of the spatial information from the watershed segmentation in the hyperspectral image classifier improves the classification accuracies and provides classification maps with more homogeneous regions, compared to pixel-wise classification and previously proposed spectral-spatial classification techniques. The developed method is especially suitable for classifying images with large spatial structures. 相似文献

18.

System for reconstruction of three-dimensional micro objects from multiple photographic images

Koutarou Atsushi Hidetada Sueyasu Yusuke Funayama Takashi MaekawaAuthor vitae 《Computer aided design》2011,(8):1045-1055

We introduce a system to reconstruct a three-dimensiojnal (3D) polygonal model of 3D micro objects with outer dimensions ranging from several hundred microns to several millimeters from multiple two-dimensional (2D) images of an object taken from different views. The data acquisition system consists of a digital microscope that captures still images at a resolution of 1600 × 1200 pixels and a computer-controlled turntable. We employ the shape-from-silhouette (SFS) method to construct a voxel-based 3D model from silhouette images. The concave shapes are further carved by using the space carving technique. In order to make the resulting model compatible with a commercial CAD/CAM system, the voxel model is converted into a triangular mesh using the marching cubes algorithm. Because the mesh generated from the voxel model by using the marching cubes algorithm inherits the staircase effect, the mesh is adjusted to recover the object precisely by using silhouette images. Finally, we evaluate the accuracy of the proposed method. The reconstructed models of complex micro objects indicate the effectiveness of the 3D shape reconstruction system for micro objects. 相似文献

19.

Reconstruction of images and recognition of polyhedral objects

Ya. A. Furman R. V. Eruslanov I. L. Egoshina 《Pattern Recognition and Image Analysis》2012,22(1):196-209

An indirect way of reconstructing the coordinates of points on the surface of a 3D object by its planar parallel projections is proposed. The approach is based on the substitution of the object by another (virtual) object, for which this operation can be carried out simply, whereas the correctness of the obtained results is controlled. The specificities of obtaining a mathematical model of reconstructed objects with a polyhedral shape, the issues of normalization of the angular discrepancies between the recognized and the etalon objects, and the solution of the problem of their recognition based on the introduced model are considered. 相似文献

20.

Retrieval of trademark images by means of size functions

A. Cerri M. Ferri D. Giorgi 《Graphical Models》2006,68(5-6):451

相似文献