共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
3.
4.
汉字具有丰富的字体类型,并且不同的字体在汉字结构上有显著的不同,现在的OCR技术侧重字的识别,而对字体识别的关注较少。提出文字相关的单字符字体识别方法,利用文字相关的先验信息及字体结构特征,对字体的相似性度量采用向量空间模型,并针对常用66款简体字进行实验,得到了较好的平均识别率。 相似文献
5.
采用了一种基于SURF的人脸识别方法,其中所提取出的SURF特征向量对于图像的尺度与旋转变化均具有较好的适应性;通过采用LDA算法有效地缩短了运算时间;此外还采用Kmeans聚类方法对特征向量进行分组处理;最后通过综合图像的局部与全局特征信息便可完成对人脸的分类工作。实验结果证明,最终所获取的LDA-SURF特征向量对于一般图像中人脸的平移、亮度变化、遮挡和噪声等具有良好的不变性。 相似文献
6.
Fei Wang Author Vitae Jingdong Wang Author Vitae Author Vitae James Kwok Author Vitae 《Pattern recognition》2007,40(10):2786-2797
Face recognition is a challenging task in computer vision and pattern recognition. It is well-known that obtaining a low-dimensional feature representation with enhanced discriminatory power is of paramount importance to face recognition. Moreover, recent research has shown that the face images reside on a possibly nonlinear manifold. Thus, how to effectively exploit the hidden structure is a key problem that significantly affects the recognition results. In this paper, we propose a new unsupervised nonlinear feature extraction method called spectral feature analysis (SFA). The main advantages of SFA over traditional feature extraction methods are: (1) SFA does not suffer from the small-sample-size problem; (2) SFA can extract discriminatory information from the data, and we show that linear discriminant analysis can be subsumed under the SFA framework; (3) SFA can effectively discover the nonlinear structure hidden in the data. These appealing properties make SFA very suitable for face recognition tasks. Experimental results on three benchmark face databases illustrate the superiority of SFA over traditional methods. 相似文献
7.
Mirehi Narges Tahmasbi Maryam Targhi Alireza Tavakoli 《Multimedia Tools and Applications》2019,78(10):13361-13386
Multimedia Tools and Applications - Hand Gestures Recognition (HGR) is one of the main areas of research for Human Computer Interaction applications. Most existing approaches are based on local or... 相似文献
8.
Yang Jinhyeok Kim Heebeom Kwak Hyobin Kim Injung 《International Journal on Document Analysis and Recognition》2019,22(4):407-416
International Journal on Document Analysis and Recognition (IJDAR) - We propose a large-scale Hangul font recognizer that is capable of recognizing 3300 Hangul fonts. Large-scale Hangul font... 相似文献
9.
Ding X Chen L Wu T 《IEEE transactions on pattern analysis and machine intelligence》2007,29(2):195-204
A novel algorithm for font recognition on a single unknown Chinese character, independent of the identity of the character, is proposed in this paper. We employ a wavelet transform on the character image and extract wavelet features from the transformed image. After a Box-Cox transformation and LDA (linear discriminant analysis) process, the discriminating features for font recognition are extracted and classified through a MQDF (Modified quadric distance function) classifier with only one prototype for each font class. Our experiments show that our algorithm can achieve a recognition rate of 90.28 percent on a single unknown character and 99.01 percent if five characters are used for font recognition. Compared with existing methods, all of which are based on a text block, our method can provide a higher recognition rate and is more flexible and robust, since it is based on a single unknown character. Additionally, our method demonstrates that it is possible to extract subtle yet discriminative signals embedded in a much larger noisy background 相似文献
10.
Pattern Analysis and Applications - At present, deep learning has made great progress in the field of glyph modeling. However, existing methods of font generation have some problems, such as... 相似文献
11.
Godfrey C. Onwubolu 《Journal of Intelligent Manufacturing》1999,10(3-4):289-299
A backpropagation neural network (BPN) is applied to the problem of feature recognition from a boundary representation (B-rep) solid model to facilitate process planning of manufactured products. It is based on the use of the face complexity code to represent the features and a neural network for the analysis of the recognition. The face complexity code is a measure of the face complexity of a feature based on the convexity or concavity of the surrounding geometry. The codes for various features are fed to the network for analysis. A backpropagation network is implemented for recognition of features and tested on published results to measure its performance. Any two or more features having significant differences in face complexity codes were used as exemplars for training the network. A new feature presented to the network is associated with one of the existing clusters, if they are similar, or the network creates a new cluster, if otherwise. Experimental results show that the network was consistent in recognizing features, hence is appropriate for application to the problem of feature recognition in automated manufacturing environment. 相似文献
12.
利用改进的主成分分析(Principal Component Analysis,PCA)方法,通过研究不同的车辆特征(如全局特征、各种局部特征)对静态图像车辆识别效果的影响,提出了一种新的静态图像车辆识别算法。该算法可有效降低光照和背景噪声对识别的影响,实现对存在部分遮挡的车辆检测。实验结果表明,该算法具有良好的鲁棒性和车辆识别率。 相似文献
13.
目的 人体行为识别是计算机视觉领域的一个重要研究课题,具有广泛的应用前景.针对局部时空特征和全局时空特征在行为识别问题中的局限性,提出一种新颖、有效的人体行为中层时空特征.方法 该特征通过描述视频中时空兴趣点邻域内局部特征的结构化分布,增强时空兴趣点的行为鉴别能力,同时,避免对人体行为的全局描述,能够灵活地适应行为的类内变化.使用互信息度量中层时空特征与行为类别的相关性,将视频识别为与之具有最大互信息的行为类别.结果 实验结果表明,本文的中层时空特征在行为识别准确率上优于基于局部时空特征的方法和其他方法,在KTH数据集和日常生活行为(ADL)数据集上分别达到了96.3%和98.0%的识别准确率.结论 本文的中层时空特征通过利用局部特征的时空分布信息,显著增强了行为鉴别能力,能够有效地识别多种复杂人体行为. 相似文献
14.
15.
A new algorithm using invariant spectral features for segmenting colour images is presented in this paper. Input data are three primary images obtained from a colour sensor. The input colour image is transformed to IHS (Intensity, Hue, Saturation) colour space. This colour space transform compensates for illumination variations and delivers image pixel values with low variance for individual colour regions, hence contributing to simplified segmentation. The hue and saturation images are then separately filtered and combined. The resulting image is segmented by means of a threshold process. An opening operation on the segmented image completes the algorithm. Experimental results obtained for several images are presented. Issues related to illumination and sensors are also addressed. 相似文献
16.
This paper is concerned with recognition of hand-written and/or printed multifont alpha-numeric Bengali characters. It is assumed that characters are present in an isolated fashion. In the present work characters have been represented in terms of the primitives and structural constraints between the primitives imposed by the junctions present in the characters. The primitives have been characterized on the basis of the significant curvature events like curvature maxima, curvature minima and inflexion points observed along their extent. Curvature properties have been extracted after thinning the smoothed character images and filtering the thinned images using a Gaussian kernel. The unknown samples are classified using a two-stage feed forward neural net based recognition scheme. Experimental results have established the effectiveness of the technique 相似文献
17.
R. S. Fyath K. N. Darraj M. S. Alam M. N. Islam M. M. Alkhatib 《Optical Memory & Neural Networks》2007,16(3):125-135
A new joint transform correlation (JTC) technique, named two-channel JTC (TJTC), is proposed in this paper for optical pattern
recognition applications. The TJTC technique independently evaluates the autocorrelation and crosscorrelation values of the
reference and the target images and employs a modified decision algorithm. In addition, optical threshold operation and fringe-adjusted
filter are incorporated in the proposed technique to enhance the correlation output and to improve the discrimination performance.
The proposed technique shows better recognition performance compared to existing JTC techniques. Computer simulation are presented
to investigate the salient features of the proposed TJTC technique with noise-free as well as noisy input scenes.
The text was submitted by the authors in English. 相似文献
18.
M. Laadjel A. Bouridane O. Nibouche F. Kurugollu S. Al-Maadeed 《Journal of Real-Time Image Processing》2013,8(3):253-263
This paper presents a bimodal biometric recognition system based on the extracted features of the human palmprint and iris using a new graph-based approach termed Fisher locality preserving projections (FLPP). This new technique employs two graphs with the first being used to characterize the within-class compactness and the second dedicated to the augmentation of the between-class separability. By applying the FLPP, only the most discriminant and stable palmprint and iris features are retained. FLPP was implemented on the frequency domain by transforming the extracted region of interest extraction of both biometric modalities using Fourier transform. Subsequently, the palmprint and iris features vectors obtained are matched with their counterpart in the templates databases and the obtained scores are fused to produce a final decision. The proposed combination of palmprint and iris patterns has shown an excellent performance compared to unimodal palmprint biometric recognition. The system was evaluated on a database of 108 subjects and the experimental results show that our system performs very well and achieves a high accuracy expressed by an equal error rate of 0.00%. 相似文献
19.
基于MFCC和LPCC的说话人识别 总被引:8,自引:0,他引:8
MFCC参数和LPCC参数是说话人识别中两种最常用的特征参数,研究了MFCC和LPCC参数提取的算法原理及差分倒谱参数的提取方法,采用MFCC、LPCC及其一阶、二阶差分作为特征参数,通过k均值算法与三层BP神经网络来进行说话人识别.实验结果表明,该方法可以有效提高识别率,同时也验证MFCC参数的鲁棒性优于LPCC参数. 相似文献
20.
基于Gabor局部相对特征的掌纹识别 总被引:1,自引:0,他引:1
Gabor变换是掌纹识别中提取纹理特征的一个重要工具,但其性能易受图像的变化以及不均衡噪声等因素影响,因此提出了一种基于Gabor局部相对特征的掌纹识别算法。该算法对原始图像进行微尺度不变Gabor滤波;结合分形学的思想,将滤波后的图像分成大小相等的子域,每个子域又分成多个相同的子块,计算每个子块与它所在子域的相对方差,将所有子块的相对方差排列组成表征图像的特征向量进行识别。该算法将微尺度不变与局部相对特性统一,所提取的特征对各种变化有很强的鲁棒性,提高了识别精度和效率。实验使用北京交通大学BJTU_PalmprintDB证明该算法的有效性。 相似文献