共查询到20条相似文献,搜索用时 15 毫秒
1.
Font recognition is useful for improving optical text recognition systems’ accuracy and time, and to restore the documents’ original formats. This paper addresses a need for Arabic font recognition research by introducing an Arabic font recognition database consisting of 40 fonts, 10 sizes (ranging from 8 to 24 points) and 4 styles (viz. normal, bold, italic, and bold–italic). The database is split into three sets (viz. training, validation, and testing). The database is freely available to researchers.1 Moreover, we introduce a baseline font recognition system for benchmarking purposes, and report identification rates on our KAFD database and the Arabic Printed Text Image (APTI) database with 20 and 10 fonts, respectively. The best recognition rates are achieved using log-Gabor filters. 相似文献
2.
3.
Katerin Romeo-Pakker Abderrahim Ameur Christian Olivier Yves Lecourtier 《Machine Vision and Applications》1995,8(4):232-240
In this paper, a structural method of recognising Arabic handwritten characters is proposed. The major problem in cursive text recognition is the segmentation into characters or into representative strokes. When we segment the cursive portions of words, we take into account the contextual properties of the Arabic grammar and the junction segments connecting the characters to each other along the writing line. The problem of overlapping characters is resolved with a contour-following algorithm associated with the labelling of the detected contours. In the recognition phase, the characters are gathered into ten families of candidate characters with similar shapes. Then a heterarchical analysis follows that checks the pattern via goal-directed feedback control. 相似文献
4.
5.
6.
7.
N. Mezghani A. Mitiche M. Cheriet 《International Journal on Document Analysis and Recognition》2005,7(4):201-210
The purpose of this study is to investigate a new representation of shape and its use in handwritten online character recognition by a Kohonen associative memory. This representation is based on the empirical distribution of features such as tangents and tangent differences at regularly spaced points along the character signal. Recognition is carried out by a Kohonen neural network trained using the representation. In addition to the Euclidean distance traditionally used in the Kohonen training algorithm to measure the similarities among feature vectors, we also investigate the Kullback–Leibler divergence and the Hellinger distance, functions that measure distance between distributions. Furthermore, we perform operations (pruning and filtering) on the trained memory to improve its classification potency. We report on extensive experiments using a database of online Arabic characters produced without constraints by a large number of writers. Comparative results show the pertinence of the representation and the superior performance of the scheme. 相似文献
8.
9.
10.
11.
提出一种新的维吾尔语文字识别研究方法。首先,建立字符样本库,并对库中文字图像归一化。然后,将测试图像与样本图像进行垂直和水平双方向投影相关性检测,对与测试图像双投影相关性较高的样本字符进行笔画数特征提取,得到预分类结果。最后,将测试图像与预分类结果进行SIFT关键点检测、方向描述子生成与配准,与测试图片匹配点对最多的预分类结果为识别结果,并输出该结果标记符号对应的维吾尔语字符。实验结果表明:该方法能减少字符样本的数量,并有效解决测试图像尺度与几何形变的差异造成的匹配困难问题。 相似文献
12.
This paper investigates the unique pharyngeal and uvular consonants of Arabic from the point of view of automatic speech recognition (ASR). Comparisons of the recognition error rates for these phonemes are analyzed in five experiments that involve different combinations of native and non-native Arabic speakers. The most three confusing consonants for every investigated consonant are discussed. All experiments use the Hidden Markov Model Toolkit (HTK) and the Language Data Consortium (LDC) WestPoint Modern Standard Arabic (MSA) database. Results confirm that these Arabic distinct consonants are a major source of difficulty for Arabic ASR. While the recognition rate for certain of these unique consonants such as // can drop below 35% when uttered by non-native speakers, there is advantage to include non-native speakers in ASR. Besides, regional differences in pronunciation of MSA by native Arabic speakers require the attention of Arabic ASR research. 相似文献
13.
为解决变光照下人脸识别的识别率低问题,提出一种最佳相关滤波和2DPCA相融合的光照人脸识别方法。通过采用特定类2DPCA重构人脸图像,生成一对相关滤波器;测试人脸图像通过相关性滤器将投影到二维子空间中,并根据预先设定的峰旁瓣比阈值进行人脸识别;最后采用PIE和YaleB人脸库进行仿真实验。相比其他人脸识别方法,该方法获得了更高的人脸识别率,鲁棒性更强。 相似文献
14.
Tusar Kanti MISHRA Banshidhar MAJHI Pankaj K SA Sandeep PANDA 《Frontiers of Computer Science in China》2014,(6):916-922
In this paper, an efficient scheme for recognition of handwritten Odia numerals using hidden markov model (HMM) has been proposed. Three different feature vectors for each of the numeral is generated through a polygonal approximation of object contour. Subsequently, aggregated feature vector for each numeral is derived from these three primary feature vectors using a fuzzy inference system. The final feature vector is divided into three levels and interpreted as three different states for HMM. Ten different three-state ergodic hidden markov models (HMMs) are thus constructed corresponding to ten numeral classes and parameters are calculated from these models. For the recognition of a probe numeral, its log-likelihood against these models are computed to decide its class label. The proposed scheme is implemented on a dataset of 2500 handwritten samples and a recognition accuracy of 96.3% has been achieved. The scheme is compared with other competent schemes. 相似文献
15.
利用局部特征点方法进行视频中的动作识别研究,通过对尺度空间理论以及多种经典的局部特征点检测方法的深入分析,将SIFT算法引入到视频研究领域,提出了一种全新而高效的视频特征点检测算法。在此基础上,设计了合理的机器学习方法对局部特征点进行训练,利用训练出的通用动作模式在视频中进行动作识别。 相似文献
16.
基于结构特征的纸币号码识别方法 总被引:1,自引:0,他引:1
针对实际工程应用中对纸币号码识别在处理速度和识别率方面的高标准要求,提出了一种基于结构特征的纸币号码识别方法。该方法对采集到的图像先进行图像放大、二值化和字符分割等预处理操作,然后根据字符的结构特征,对字符进行分类、识别。该方法不需要图像去噪、倾斜校正和归一化等操作,节省大量时间。实验结果表明,该方法识别率可达98.13%,而识别时间仅为3.8 ms,显示出该方法具备识别率高和识别速度快的优势,能够满足实际需要,已经成功应用于清分机中。 相似文献
17.
Neural networks in the visual system may be performing sparse coding of learnt local features that are qualitatively very
similar to the receptive fields of simple cells in the primary visual cortex, V1. In conventional sparse coding, the data
are described as a combination of elementary features involving both additive and subtractive components. However, the fact
that features can ‘cancel each other out’ using subtraction is contrary to the intuitive notion of combining parts to form
a whole. Thus, it has recently been argued forcefully for completely non-negative representations. This paper presents Non-Negative
Sparse Coding (NNSC) applied to the learning of facial features for face recognition and a comparison is made with the other
part-based techniques, Non-negative Matrix Factorization (NMF) and Local-Non-negative Matrix Factorization (LNMF). The NNSC
approach has been tested on the Aleix–Robert (AR), the Face Recognition Technology (FERET), the Yale B, and the Cambridge
ORL databases, respectively. In doing so, we have compared and evaluated the proposed NNSC face recognition technique under
varying expressions, varying illumination, occlusion with sunglasses, occlusion with scarf, and varying pose. Tests were performed
with different distance metrics such as the L
1-metric, L
2-metric, and Normalized Cross-Correlation (NCC). All these experiments involved a large range of basis dimensions. In general,
NNSC was found to be the best approach of the three part-based methods, although it must be observed that the best distance
measure was not consistent for the different experiments. 相似文献
18.
针对现有人体行为识别技术存在的准确率不高和易受环境干扰等缺点,提出一种基于空时特征融合的人体行为识别方法。通过OpenPose提取人体骨骼关节的位置信息用于构造空时融合特征,该特征综合各类行为的空域和时域信息,使得特征表示更具区分度。利用核化主成分分析算法进行特征维度缩减,利用XGBoost算法进行特征分类,获得识别结果。该方法在Multiview Action 3D数据集上进行测试,得到了94.52%的识别率,较现有的其它许多人体行为识别方法表现更好。 相似文献
19.
Guangpeng ZhangYunhong Wang 《Pattern recognition letters》2011,32(7):1009-1019
A novel resolution invariant local feature based method is proposed for 3D face recognition. Scale space extrema on shape index images and texture images are detected and matched, through which resolution and noise insensitive face matching is achieved without complex preprocessing and normalization. An outlier removal strategy is designed to eliminate incorrect matching points while keeping relevant ones. Six different scale invariant similarity measures are proposed and fused at the score level, which increases the robustness against expression variations. Systematical experiments are conducted on the FRGC v2.0 database, achieving in the neutral vs. all experiment a verification rate of 90.7% with un-normalized similarity scores, and 96.3% with normalized similarity scores at False Acceptance Rate (FAR) of 0.1%, and 96.2% rank-1 identification rate, which are comparable to the state of the art, and promising considering the significantly reduced preprocessing requirement. 相似文献