期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Incremental learning of bidirectional principal components for face recognition

Chuan-Xian Ren Author Vitae Author Vitae 《Pattern recognition》2010,43(1):318-330

Recently, bidirectional principal component analysis (BDPCA) has been proven to be an efficient tool for pattern recognition and image analysis. Encouraging experimental results have been reported and discussed in the literature. However, BDPCA has to be performed in batch mode, it means that all the training data has to be ready before we calculate the projection matrices. If there are additional samples need to be incorporated into an existing system, it has to be retrained with the whole updated training set. Moreover, the scatter matrices of BDPCA are formulated as the sum of K (samples size) image covariance matrices, this leads to the incremental learning directly on the scatters impossible, thus it presents new challenge for on-line training.In fact, there are two major reasons for building incremental algorithms. The first reason is that in some cases, when the number of training images is very large, the batch algorithm cannot process the entire training set due to large computational or space requirements of the batch approach. The second reason is when the learning algorithm is supposed to operate in a dynamical settings, that all the training data is not given in advance, and new training samples may arrive at any time, and they have to be processed in an on-line manner. Through matricizations of third-order tensor, we successfully transfer the eigenvalue decomposition problem of scatters to the singular value decomposition (SVD) of corresponding unfolded matrices, followed by complexity and memory analysis on the novel algorithm. A theoretical clue for selecting suitable dimensionality parameters without losing classification information is also presented in this paper. Experimental results on FERET and CMU PIE (pose, illumination, and expression) databases show that the IBDPCA algorithm gives a close approximation to the BDPCA method, but using less time. 相似文献

2.

Multiscale facial structure representation for face recognition under varying illumination

Taiping Zhang Author Vitae Author Vitae Yuan Yuan Author Vitae Author Vitae Zhaowei Shang Author Vitae Author Vitae Fangnian Lang Author Vitae 《Pattern recognition》2009,42(2):251-258

Facial structure of face image under lighting lies in multiscale space. In order to detect and eliminate illumination effect, a wavelet-based face recognition method is proposed in this paper. In this work, the effect of illuminations is effectively reduced by wavelet-based denoising techniques, and meanwhile the multiscale facial structure is generated. Among others, the proposed method has the following advantages: (1) it can be directly applied to single face image, without any prior information of 3D shape or light sources, nor many training samples; (2) due to the multiscale nature of wavelet transform, it has better edge-preserving ability in low frequency illumination fields; and (3) the parameter selection process is computationally feasible and fast. Experiments are carried out upon the Yale B and CMU PIE face databases, and the results demonstrate that the proposed method achieves satisfactory recognition rates under varying illumination conditions. 相似文献

3.

Deep learning face representation by fixed erasing in facial landmarks

Lei Jie Zhang BaiYan Ling HeFei 《Multimedia Tools and Applications》2019,78(19):27703-27718

Multimedia Tools and Applications - Face verification (FV) is a challenging problem, because occlusion, posture, illumination, aging will affect the accuracy of FV. Deep convolutional neural... 相似文献

4.

Hybrid-boost learning for multi-pose face detection and facial expression recognition

Hsiuao-Ying Chen Chung-Lin Huang Chih-Ming Fu 《Pattern recognition》2008,41(3):1173-1185

This paper proposes a hybrid-boost learning algorithm for multi-pose face detection and facial expression recognition. To speed-up the detection process, the system searches the entire frame for the potential face regions by using skin color detection and segmentation. Then it scans the skin color segments of the image and applies the weak classifiers along with the strong classifier for face detection and expression classification. This system detects human face in different scales, various poses, different expressions, partial-occlusion, and defocus. Our major contribution is proposing the weak hybrid classifiers selection based on the Harr-like (local) features and Gabor (global) features. The multi-pose face detection algorithm can also be modified for facial expression recognition. The experimental results show that our face detection system and facial expression recognition system have better performance than the other classifiers. 相似文献

5.

2D representation of facial surfaces for multi-pose 3D face recognition

Yan-Ning ZhangZhe Guo Yong Xia Zeng-Gang LinDavid Dagan Feng 《Pattern recognition letters》2012,33(5):530-536

The increasing availability of 3D facial data offers the potential to overcome the intrinsic difficulties faced by conventional face recognition using 2D images. Instead of extending 2D recognition algorithms for 3D purpose, this letter proposes a novel strategy for 3D face recognition from the perspective of representing each 3D facial surface with a 2D attribute image and taking the advantage of the advances in 2D face recognition. In our approach, each 3D facial surface is mapped homeomorphically onto a 2D lattice, where the value at each site is an attribute that represents the local 3D geometrical or textural properties on the surface, therefore invariant to pose changes. This lattice is then interpolated to generate a 2D attribute image. 3D face recognition can be achieved by applying the traditional 2D face recognition techniques to obtained attribute images. In this study, we chose the pose invariant local mean curvature calculated at each vertex on the 3D facial surface to construct the 2D attribute image and adopted the eigenface algorithm for attribute image recognition. We compared our approach to state-of-the-art 3D face recognition algorithms in the FRGC (Version 2.0), GavabDB and NPU3D database. Our results show that the proposed approach has improved the robustness to head pose variation and can produce more accurate 3D multi-pose face recognition. 相似文献

6.

Incremental complete LDA for face recognition

Gui-Fu Lu Jian Zou Yong Wang 《Pattern recognition》2012,45(7):2510-2521

The complete linear discriminant analysis (CLDA) algorithm has been proven to be an effective tool for face recognition. The CLDA method can make full use of the discriminant information of the training samples. However, the original implementation of CLDA may not suitable for incremental learning problem. In this paper, we first propose a new implementation of CLDA, which is theoretically equivalent to the original implementation of CLDA but is more efficient than the original one. Then, based on our proposed novel implementation of CLDA, we propose the incremental CLDA method which can accurately update the discriminant vectors of CLDA when new samples are inserted into the training set. Experiments on ORL, AR and PIE face databases show the efficiency of our proposed CLDA algorithms over the original implementation of CLDA. 相似文献

7.

Bimode model for face recognition and face representation

Hui YanAuthor Vitae Jian YangAuthor VitaeJingyu YangAuthor Vitae 《Neurocomputing》2011,74(5):741-748

Tensorface based approaches decompose an image into its constituent factors (i.e., person, lighting, viewpoint, etc.), and then utilize these factor spaces for recognition. However, tensorface is not a preferable choice, because of the complexity of its multimode. In addition, a single mode space, except the person-space, could not be used for recognition directly. From the viewpoint of practical application, we propose a bimode model for face recognition and face representation. This new model can be treated as a simplified model representation of tensorface. However, their respective algorithms for training are completely different, due to their different definitions of subspaces. Thanks to its simpler model form, the proposed model requires less iteration times in the process of training and testing. Moreover bimode model can be further applied to an image reconstruction and image synthesis via an example image. Comprehensive experiments on three face image databases (PEAL, YaleB frontal and Weizmann) validate the effectiveness of the proposed new model. 相似文献

8.

Discriminative sparse representation for face recognition

Zhihong Zhang Yuanheng Liang Lu Bai Edwin R. Hancock 《Multimedia Tools and Applications》2016,75(7):3973-3992

Recently Sparse Representation (or coding) based Classification (SRC) has gained great success in face recognition. In SRC, the testing image is expected to be best represented as a sparse linear combination of training images from the same class, and the representation fidelity is measured by the ?₂-norm or ?₁-norm of the coding residual. However, SRC emphasizes the sparsity too much and overlooks the spatial information during local feature encoding process which has been demonstrated to be critical in real-world face recognition problems. Besides, some work considers the spatial information but overlooks the different discriminative ability in different face regions. In this paper, we propose to weight spatial locations based on their discriminative abilities in sparse coding for robust face recognition. Specifically, we learn the weights at face locations according to the information entropy in each face region, so as to highlight locations in face images that are important for classification. Furthermore, in order to construct a robust weights to fully exploit structure information of each face region, we employed external data to learn the weights, which can cover all possible face image variants of different persons, so the robustness of obtained weights can be guaranteed. Finally, we consider the group structure of training images (i.e. those from the same subject) and added an ?_2,1-norm (group Lasso) constraint upon the formulation, which enforcing the sparsity at the group level. Extensive experiments on three benchmark face datasets demonstrate that our proposed method is much more robust and effective than baseline methods in dealing with face occlusion, corruption, lighting and expression changes, etc. 相似文献

9.

Deep representation learning for face hallucination

Lu Tao Wang Yu Xu Ruobo Liu Wei Fang Wenhua Zhang Yanduo 《Multimedia Tools and Applications》2022,81(5):6305-6330

Multimedia Tools and Applications - Recently, deep learning, as a novel emerging algorithm, offers an end-to-end effective paradigm for super-resolution. Various successful practices with the deep... 相似文献

10.

Incremental template updating for face recognition in home environments

Annalisa Franco Author Vitae Dario Maio Author Vitae Davide Maltoni Author Vitae 《Pattern recognition》2010,43(8):2891-2903

The extreme variability of faces in smart environment applications, due to continuous changes in terms of pose, illumination and subject appearance (hairstyle, make-up, etc.), requires the relevant mode of variations of the subject's faces to be encoded in the templates and to be continuously updated based on new inputs. This work proposes a new video-based template updating approach suitable for home environments where the image acquisition process is totally unconstrained but a large amount of face data is available for continuous learning. A small set of labeled images is initially used to create the templates and the updating is then totally unsupervised. Although the method is here presented in conjunction with a subspace-based face recognition approach, it can be easily adapted to deal with different kinds of face representations. A thorough performance evaluation is carried out to show the efficacy and reliability of the proposed technique. 相似文献

11.

Incremental face recognition for large-scale social network services

Kwontaeg Choi Kar-Ann Toh Hyeran Byun 《Pattern recognition》2012,45(8):2868-2883

Due to the rapid growth of social network services such as Facebook and Twitter, incorporation of face recognition in these large-scale web services is attracting much attention in both academia and industry. The major problem in such applications is to deal efficiently with the growing number of samples as well as local appearance variations caused by diverse environments for the millions of users over time. In this paper, we focus on developing an incremental face recognition method for Twitter application. Particularly, a data-independent feature extraction method is proposed via binarization of a Gabor filter. Subsequently, the dimension of our Gabor representation is reduced considering various orientations at different grid positions. Finally, an incremental neural network is applied to learn the reduced Gabor features. We apply our method to a novel application which notifies new photograph uploading to related users without having their ID being identified. Our extensive experiments show that the proposed algorithm significantly outperforms several incremental face recognition methods with a dramatic reduction in computational speed. This shows the suitability of the proposed method for a large-scale web service with millions of users. 相似文献

12.

Incremental linear discriminant analysis for face recognition. 总被引：3，自引：0，他引：3

Haitao Zhao Pong Chi Yuen 《IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics》2008,38(1):210-221

Dimensionality reduction methods have been successfully employed for face recognition. Among the various dimensionality reduction algorithms, linear (Fisher) discriminant analysis (LDA) is one of the popular supervised dimensionality reduction methods, and many LDA-based face recognition algorithms/systems have been reported in the last decade. However, the LDA-based face recognition systems suffer from the scalability problem. To overcome this limitation, an incremental approach is a natural solution. The main difficulty in developing the incremental LDA (ILDA) is to handle the inverse of the within-class scatter matrix. In this paper, based on the generalized singular value decomposition LDA (LDA/GSVD), we develop a new ILDA algorithm called GSVD-ILDA. Different from the existing techniques in which the new projection matrix is found in a restricted subspace, the proposed GSVD-ILDA determines the projection matrix in full space. Extensive experiments are performed to compare the proposed GSVD-ILDA with the LDA/GSVD as well as the existing ILDA methods using the face recognition technology face database and the Carneggie Mellon University Pose, Illumination, and Expression face database. Experimental results show that the proposed GSVD-ILDA algorithm gives the same performance as the LDA/GSVD with much smaller computational complexity. The experimental results also show that the proposed GSVD-ILDA gives better classification performance than the other recently proposed ILDA algorithms. 相似文献

13.

Bidirectional representation for face recognition across pose

Jinrong Cui 《Neural computing & applications》2013,23(5):1437-1442

Conventional representation methods try to express the test sample as a weighting sum of training samples and exploit the deviation between the test sample and the weighting sum of the training samples from each class (also referred to as deviation between the test sample and each class) to classify the test sample. In particular, the methods assign the test sample to the class that has the smallest deviation among all the classes. This paper analyzes the relationship between face images under different poses and, for the first time, devises a bidirectional representation method-based pattern classification (BRBPC) method for face recognition across pose. BRBPC includes the following three steps: the first step uses the procedure of conventional representation methods to express the test sample and calculates the deviation between the test sample and each class. The second step first expresses the training sample of a class as a weighting sum of the test sample and the training samples from all the other classes and then obtains the corresponding deviation (referred to as complementary deviation). The third step uses the score-level fusion to integrate the scores, that is, deviations generated from the first and second steps for final classification. The experimental results show that BRBPC classifies more accurately than conventional representation methods. 相似文献

14.

Multi-resolution dictionary collaborative representation for face recognition

Liu Zhen Wu Xiao-Jun Shu Zhenqiu 《Pattern Analysis & Applications》2021,24(4):1793-1803

Pattern Analysis and Applications - In this paper, a multi-resolution dictionary collaborative representation(MRDCR) method for face recognition is proposed. Unlike most of the traditional sparse... 相似文献

15.

Local sparse representation projections for face recognition

Zhihui Lai Yajing Li Minghua Wan Zhong Jin 《Neural computing & applications》2013,23(7-8):2231-2239

How to define the sparse affinity weight matrices is still an open problem in existing manifold learning algorithm. In this paper, we propose a novel supervised learning method called local sparse representation projections (LSRP) for linear dimensionality reduction. Differing from sparsity preserving projections (SPP) and the recent manifold learning methods such as locality preserving projections (LPP), LSRP introduces the local sparse representation information into the objective function. Although there are no labels used in the local sparse representation, it still can provide better measure coefficients and significant discriminant abilities. By combining the local interclass neighborhood relationships and sparse representation information, LSRP aims to preserve the local sparse reconstructive relationships of the data and simultaneously maximize the interclass separability. Comprehensive comparison and extensive experiments show that LSRP achieves higher recognition rates than principle component analysis, linear discriminant analysis and the state-of-the-art techniques such as LPP, SPP and maximum variance projections. 相似文献

16.

Deep metric learning for open-set human action recognition in videos

Gutoski Matheus Lazzaretti André Eugênio Lopes Heitor Silvério 《Neural computing & applications》2021,33(4):1207-1220

Neural Computing and Applications - Human action recognition (HAR) is a topic widely studied in computer vision and pattern recognition. Despite the success of recent models for this issue, most of... 相似文献

17.

Texture-independent recognition of facial expressions in image snapshots and videos

Bogdan Raducanu Fadi Dornaika 《Machine Vision and Applications》2013,24(4):811-820

This paper addresses the static and dynamic recognition of basic facial expressions. It has two main contributions. First, we introduce a view- and texture-independent scheme that exploits facial action parameters estimated by an appearance-based 3D face tracker. We represent the learned facial actions associated with different facial expressions by time series. Second, we compare this dynamic scheme with a static one based on analyzing individual snapshots and show that the former performs better than the latter. We provide evaluations of performance using three subspace learning techniques: linear discriminant analysis, non-parametric discriminant analysis and support vector machines. 相似文献

18.

A Kernel-based sparse representation method for face recognition

Ningbo Zhu Shengtao Li 《Neural computing & applications》2014,24(3-4):845-852

Sparse Representation Method has been proved to outperform conventional face recognition (FR) methods and is widely applied in recent years. A novel Kernel-based Sparse Representation Method (KBSRM) is proposed in this paper. In order to cope with the possible complex variation of the face images caused by varying facial expression and pose, the KBSRM first uses a kernel-induced distance to determine N nearest neighbors of the testing sample from all the training samples. Then, in the second step, the KBSRM represents the testing sample as a linear combination of the determinate N nearest neighbors and performs the classification by the representation result. It can be inferred that the N nearest training samples selected are closer to the test sample than the rest, so using the N nearest neighbors to represent the testing sample can make the ultimate classification more accurate. A number of FR experiments show that the KBSRM can achieve a better classification result than the algorithm mentioned in Xu et al. (Neural Comput Appl doi:10.1007/s00521-012-0833-5). 相似文献

19.

Supervised neighborhood regularized collaborative representation for face recognition

Hongmei Chi Haifeng Xia Xin Tang Yinghao Zhang Xiaofen Xia 《Multimedia Tools and Applications》2018,77(22):29509-29529

How to represent a test sample is very crucial for linear representation based classification. The famous sparse representation focuses on employing linear combination of small samples to represent the query sample. However, the local structure and label information of data are neglected. Recently, locality-constrained collaborative representation (LCCR) has been proposed and integrates a kind of locality-constrained term into the collaborative representation scheme. For each test sample, LCCR mainly considers its neighbors to deal with noise and LCCR is robust to various corruptions. However, the nearby samples may not belong to the same class. To deal with this situation, in this paper, we not only utilize the positive effect of neighbors, but also consider the side effect of neighbors. A novel supervised neighborhood regularized collaborative representation (SNRCR) is proposed, which employs the local structure of data and the label information of neighbors to improve the discriminative capability of the coding vector. The objective function of SNRCR obtains the global optimal solution. Many experiments are conducted over six face data sets and the results show that SNRCR outperforms other algorithms in most case, especially when the size of training data is relatively small. We also analyze the differences between SNRCR and LCCR. 相似文献

20.

Learning locality-constrained collaborative representation for robust face recognition

Xi Peng Lei Zhang Zhang Yi Kok Kiong Tan 《Pattern recognition》2014

The models of low-dimensional manifold and sparse representation are two well-known concise models that suggest that each data can be described by a few characteristics. Manifold learning is usually investigated for dimension reduction by preserving some expected local geometric structures from the original space into a low-dimensional one. The structures are generally determined by using pairwise distance, e.g., Euclidean distance. Alternatively, sparse representation denotes a data point as a linear combination of the points from the same subspace. In practical applications, however, the nearby points in terms of pairwise distance may not belong to the same subspace, and vice versa. Consequently, it is interesting and important to explore how to get a better representation by integrating these two models together. To this end, this paper proposes a novel coding algorithm, called Locality-Constrained Collaborative Representation (LCCR), which introduce a kind of local consistency into coding scheme to improve the discrimination of the representation. The locality term derives from a biologic observation that the similar inputs have similar codes. The objective function of LCCR has an analytical solution, and it does not involve local minima. The empirical studies based on several popular facial databases show that LCCR is promising in recognizing human faces with varying pose, expression and illumination, as well as various corruptions and occlusions. 相似文献