排序方式: 共有42条查询结果,搜索用时 96 毫秒
1.
Face recognition in surveillance systems is important for security applications, especially in nighttime scenarios when the subject is far away from the camera. However, due to the face image quality degradation caused by large camera standoff and low illuminance, nighttime face recognition at large standoff is challenging. In this paper, we report a system that is capable of collecting face images at large standoff in both daytime and nighttime, and present an augmented heterogeneous face recognition (AHFR) approach for cross-distance (e.g., 150 m probe vs. 1 m gallery) and cross-spectral (near-infrared probe vs. visible light gallery) face matching. We recover high-quality face images from degraded probe images by proposing an image restoration method based on Locally Linear Embedding (LLE). The restored face images are matched to the gallery by using a heterogeneous face matcher. Experimental results show that the proposed AHFR approach significantly outperforms the state-of-the-art methods for cross-spectral and cross-distance face matching. 相似文献
2.
Parameter-free geometric document layout analysis 总被引:1,自引:0,他引:1
Seong-Whan Lee Dae-Seok Ryu 《IEEE transactions on pattern analysis and machine intelligence》2001,23(11):1240-1256
Automatic transformation of paper documents into electronic documents requires geometric document layout analysis at the first stage. However, variations in character font sizes, text line spacing, and document layout structures have made it difficult to design a general-purpose document layout analysis algorithm for many years. The use of some parameters has therefore been unavoidable in previous methods. The authors propose a parameter-free method for segmenting the document images into maximal homogeneous regions and identifying them as texts, images, tables, and ruling lines. A pyramidal quadtree structure is constructed for multiscale analysis and a periodicity measure is suggested to find a periodical attribute of text regions for page segmentation. To obtain robust page segmentation results, a confirmation procedure using texture analysis is applied to only ambiguous regions. Based on the proposed periodicity measure, multiscale analysis, and confirmation procedure, we could develop a robust method for geometric document layout analysis independent of character font sizes, text line spacing, and document layout structures. The proposed method was experimented with the document database from the University of Washington and the MediaTeam Document Database. The results of these tests have shown that the proposed method provides more accurate results than previous ones 相似文献
3.
Sang-Cheol Park Author Vitae Author Vitae Seong-Whan Lee Author Vitae 《Pattern recognition》2004,37(4):767-779
In this paper, we propose a new method for estimating camera motion parameters based on optical flow models. Camera motion parameters are generated using linear combinations of optical flow models. The proposed method first creates these optical flow models, and then linear decompositions are performed on the input optical flows calculated from adjacent images in the video sequence, which are used to estimate the coefficients of each optical flow model. These coefficients are then applied to the parameters used to create each optical flow model, and the camera motion parameters implied in the adjacent images can be estimated through a linear composition of the weighted parameters.We demonstrated that the proposed method estimates the camera motion parameters accurately and at a low computational cost as well as robust to noise residing in the video sequence being analyzed. 相似文献
4.
The neuroimaging community heavily relies on statistical inference to explain measured brain activity given the experimental paradigm. Undeniably, this method has led to many results, but it is limited by the richness of the generative models that are deployed, typically in a mass-univariate way. Such an approach is suboptimal given the high-dimensional and complex spatiotemporal correlation structure of neuroimaging data.Over the recent years, techniques from pattern recognition have brought new insights into where and how information is stored in the brain by prediction of the stimulus or state from the data. Pattern recognition is intrinsically multivariate and the underlying models are data-driven. Moreover, the predictive setting is more powerful for many applications, including clinical diagnosis and brain–computer interfacing. This special issue features a number of papers that identify and tackle remaining challenges in this field. The specific problems at hand constitute opportunities for future research in pattern recognition and neurosciences. 相似文献
5.
In temporal data analysis, noisy data is inevitable in both testing and training. This noise can seriously influence the performance of the temporal data analysis. To address this problem, we propose a novel method, termed Selective Temporal Filtering that builds a noise-free model for classification during training and identifies key-feature vectors that are noise-filtered data from the input sequence during testing. The use of these key-feature vectors makes the classifier robust to noise within the input space. The proposed method is validated on a synthetic-dataset and a database of American Sign Language. Using key-feature vectors results in robust performance with respect to the noise content. Futhermore, we are able to show that the proposed method not only outperforms Conditional Random Fields and Hidden Markov Models in noisy environments, but also in a well-controlled environment where we assume no significant noise vectors exist. 相似文献
6.
Chang-Yu Lu Myung-Cheol Roh Seung-Yeon Kang Seong-Whan Lee 《Pattern Analysis & Applications》2012,15(2):175-187
The amount of user created contents has been increasing rapidly and is associated with a serious copyright problem. Automatic logo detection and recognition in videos is a natural and efficient way of overcoming the copyright problem. However, logos have varying characteristics, which make logo detection and recognition very difficult. Moreover, logo transitions between two different logos exist in one video comprising several video contents. This disrupts the automatic logo detection and recognition. Therefore, in order to improve logo detection, it is necessary to take into account the logo transitions explicitly. This paper proposes an accurate logo transition detection method for recognizing logos in digital video contents. The proposed method accurately segments a video according to logo and efficiently recognizes various types of logos. The experimental results demonstrate the effectiveness of the proposed method for logo detection and video segmentation according to logo. 相似文献
7.
Sang-Woong Lee Author VitaeAuthor Vitae Seong-Whan Lee Author Vitae 《Pattern recognition》2007,40(5):1605-1620
Recently, the importance of face recognition has been increasingly emphasized since popular CCD cameras are distributed to various applications. However, facial images are dramatically changed by lighting variations, so that facial appearance changes caused serious performance degradation in face recognition. Many researchers have tried to overcome these illumination problems using diverse approaches, which have required a multiple registered images per person or the prior knowledge of lighting conditions. In this paper, we propose a new method for face recognition under arbitrary lighting conditions, given only a single registered image and training data under unknown illuminations. Our proposed method is based on the illuminated exemplars which are synthesized from photometric stereo images of training data. The linear combination of illuminated exemplars can represent the new face and the weighted coefficients of those illuminated exemplars are used as identity signature. We make experiments for verifying our approach and compare it with two traditional approaches. As a result, higher recognition rates are reported in these experiments using the illumination subset of Max-Planck Institute face database and Korean face database. 相似文献
8.
9.
10.
Automatic document processing: A survey 总被引:8,自引:0,他引:8