首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The recognition of the Arabic characters is a crucial task in computer vision and Natural Language Processing fields. Some major complications in recognizing handwritten texts include distortion and pattern variabilities. So, the feature extraction process is a significant task in NLP models. If the features are automatically selected, it might result in the unavailability of adequate data for accurately forecasting the character classes. But, many features usually create difficulties due to high dimensionality issues. Against this background, the current study develops a Sailfish Optimizer with Deep Transfer Learning-Enabled Arabic Handwriting Character Recognition (SFODTL-AHCR) model. The projected SFODTL-AHCR model primarily focuses on identifying the handwritten Arabic characters in the input image. The proposed SFODTL-AHCR model pre-processes the input image by following the Histogram Equalization approach to attain this objective. The Inception with ResNet-v2 model examines the pre-processed image to produce the feature vectors. The Deep Wavelet Neural Network (DWNN) model is utilized to recognize the handwritten Arabic characters. At last, the SFO algorithm is utilized for fine-tuning the parameters involved in the DWNN model to attain better performance. The performance of the proposed SFODTL-AHCR model was validated using a series of images. Extensive comparative analyses were conducted. The proposed method achieved a maximum accuracy of 99.73%. The outcomes inferred the supremacy of the proposed SFODTL-AHCR model over other approaches.  相似文献   

2.
The paper discusses the segmentation of words into characters, which is an essential task in the development process of character recognition systems, as poorly segmented characters will automatically be unrecognized. The segmentation of offline handwritten Arabic text poses a greater challenge because of its cursive nature and different writing styles. In this article, we propose a new approach to segment handwritten Arabic characters using an efficient analysis of the vertical projection histogram. Our approach was tested using a set of handwritten Arabic words from the IFN/ENIT database, and promising results were obtained.  相似文献   

3.
4.
This paper presents a handwritten document recognition system based on the convolutional neural network technique. In today’s world, handwritten document recognition is rapidly attaining the attention of researchers due to its promising behavior as assisting technology for visually impaired users. This technology is also helpful for the automatic data entry system. In the proposed system prepared a dataset of English language handwritten character images. The proposed system has been trained for the large set of sample data and tested on the sample images of user-defined handwritten documents. In this research, multiple experiments get very worthy recognition results. The proposed system will first perform image pre-processing stages to prepare data for training using a convolutional neural network. After this processing, the input document is segmented using line, word and character segmentation. The proposed system get the accuracy during the character segmentation up to 86%. Then these segmented characters are sent to a convolutional neural network for their recognition. The recognition and segmentation technique proposed in this paper is providing the most acceptable accurate results on a given dataset. The proposed work approaches to the accuracy of the result during convolutional neural network training up to 93%, and for validation that accuracy slightly decreases with 90.42%.  相似文献   

5.
6.
N. Tripathy  U. Pal 《Sadhana》2006,31(6):755-769
Segmentation of handwritten text into lines, words and characters is one of the important steps in the handwritten text recognition process. In this paper we propose a water reservoir concept-based scheme for segmentation of unconstrained Oriya handwritten text into individual characters. Here, at first, the text image is segmented into lines, and the lines are then segmented into individual words. For line segmentation, the document is divided into vertical stripes. Analysing the heights of the water reservoirs obtained from different components of the document, the width of a stripe is calculated. Stripe-wise horizontal histograms are then computed and the relationship of the peak-valley points of the histograms is used for line segmentation. Based on vertical projection profiles and structural features of Oriya characters, text lines are segmented into words. For character segmentation, at first, the isolated and connected (touching) characters in a word are detected. Using structural, topological and water reservoir concept-based features, characters of the word that touch are then segmented. From experiments we have observed that the proposed “touching character” segmentation module has 96.7% accuracy for two-character touching strings.  相似文献   

7.
In recent years, Deep Learning models have become indispensable in several fields such as computer vision, automatic object recognition, and automatic natural language processing. The implementation of a robust and efficient handwritten text recognition system remains a challenge for the research community in this field, especially for the Arabic language, which, compared to other languages, has a dearth of published works. In this work, we presented an efficient and new system for offline Arabic handwritten text recognition. Our new approach is based on the combination of a Convolutional Neural Network (CNN) and a Bidirectional Long-Term Memory (BLSTM) followed by a Connectionist Temporal Classification layer (CTC). Moreover, during the training phase of the model, we introduce an algorithm of data augmentation to increase the quality of data. Our proposed approach can recognize Arabic handwritten texts without the need to segment the characters, thus overcoming several problems related to this point. To train and test (evaluate) our approach, we used two Arabic handwritten text recognition databases, which are IFN/ENIT and KHATT. The Experimental results show that our new approach, compared to other methods in the literature, gives better results.  相似文献   

8.
许秦蓉 《包装工程》2014,35(21):80-85
目的在脱机手写体文字识别系统中,由于自由书写的字符不可避免地受到图像背景不均匀、图像倾斜和字符粘连及大小不一等因素的影响,为了确保字符切分和识别的正确性,对EMS表单中手写体汉字字符图像预处理方法进行探讨,展示了EMS表单图像预处理的全过程。方法采用最小二乘法作拟合直线的方法,对目标图像进行定位和分割,用基于大津阈值的分块阈值算法处理目标图像的背景不均问题,并减少噪声干扰。结果该图像预处理方法在1020张真实EMS图像上进行测试,识别正确率达到了86.3%。结论该方法有一定的灵活性和抗干扰性,减少了图像噪声对汉字字符切分和识别的影响。  相似文献   

9.
10.
11.
12.
In the digestion of amino acids, carbohydrates, and lipids, as well as protein synthesis from the consumed food, the liver has many diverse responsibilities and functions that are to be performed. Liver disease may impact the hormonal and nutritional balance in the human body. The earlier diagnosis of such critical conditions may help to treat the patient effectively. A computationally efficient AW-HARIS algorithm is used in this paper to perform automated segmentation of CT scan images to identify abnormalities in the human liver. The proposed approach can recognize the abnormalities with better accuracy without training, unlike in supervisory procedures requiring considerable computational efforts for training. In the earlier stages, the CT images are pre-processed through an Adaptive Multiscale Data Condensation Kernel to normalize the underlying noise and enhance the image’s contrast for better segmentation. Then, the preliminary phase’s outcome is being fed as the input for the Anisotropic Weighted–-Heuristic Algorithm for Real-time Image Segmentation algorithm that uses texture-related information, which has resulted in precise outcome with acceptable computational latency when compared to that of its counterparts. It is observed that the proposed approach has outperformed in the majority of the cases with an accuracy of 78%. The smart diagnosis approach would help the medical staff accurately predict the abnormality and disease progression in earlier ailment stages.  相似文献   

13.
姜继春  王晓红  许秦蓉 《包装工程》2014,35(19):114-118
目的在不受光照条件的影响下,利用H-Cb混合颜色模型,提取快递单底单图像手写体文字信息。方法首先将图像从RGB颜色空间分别转换到HSI颜色空间和YCbCr颜色空间;然后将改进的YCbCr颜色空间的Cb颜色分量与HSI颜色空间的H颜色分量进行信息融合;最后对提取出的手写体文字信息进行阈值和反相处理,并将该算法提取结果与基于YCbCr颜色空间Cb颜色分量阈值分割方法和基于Lab颜色空间的手写文字聚类算法的提取结果,在分割效果、文字识别率上进行对比。结果利用H-Cb混合颜色模型检测出的手写体文字更准确,具有更高的识别率,在理想文字切分条件下识别率达96%。结论使用H-Cb混合颜色模型提取手写文字受光照条件影响小,提取出的图像噪声小、识别率高,算法简单可行,为彩色图像的检测与判定技术提供了支撑。  相似文献   

14.
基于 Lab 颜色空间的手写文字提取算法研究   总被引:3,自引:1,他引:2  
目的研究颜色空间聚类在彩色手写体文字提取方面的应用。方法分别在Lab,LUV,YCbCr颜色空间以及YIQ颜色空间下,进行手写体文字图像聚类效果的分析比较,并结合空间域滤波增强与边缘检测技术提取出所需要的手写体文字信息。结果所选择研究对象在Lab颜色空间下对手写体文字具有较好的提取效果,有利于后续的文字识别。结论颜色空间聚类方法能有效避免灰度转换造成颜色信息丢失而引起的误判,在保证原有阈值分割算法快速、简单的前提下,能够对彩色图像进行更为准确的分割。  相似文献   

15.
16.
17.
This paper proposes a novel three-dimensional convolution neural network-based modified bidirectional long short-term memory with pelican optimization (3D CNN based MBiLSTM with PO) algorithm for multiclass ovarian tumor detection. Initially, the International Collaboration on Cancer Reporting endometrial cancer dataset images are provided in pre-processing phase, which uses a pre-emphasis filter to process the input image. In the segmentation phase, pre-processed data is then partitioned into diverse subgroups (i.e., pixels), which minimizes the complexity of images. In this paper, a factorization-based active contour technique is employed in the effective segmentation of images. The segmented features are then extracted and classified using the 3D CNN-MBiLSTM with PO algorithm. Finally, the experimental results are conducted and compared with various other approaches for various performance metrics. Each metric is evaluated with respect to the different number of iterations. The accuracy, sensitivity, and specificity have reached a higher value of 98.5%, 96%, and 98.25%, respectively.  相似文献   

18.
基于 YCbCr 颜色空间的快递单手写文字分割   总被引:3,自引:2,他引:1  
目的在YCbCr颜色空间下,利用Cb颜色分量信息结合阈值分割方法,提取快递单图像手写体文字信息。方法首先将图像从RGB颜色空间转换到YCbCr颜色空间下,然后在Cb颜色分量图像下进行图像阈值分割处理操作,最后对提取出的手写体文字信息进行中值滤波去噪处理,并将该算法提取的结果与基于YCbCr颜色空间使用K均值聚类方法提取的结果在分割效果、分割时间与文字识别率上进行对比。结果利用Cb颜色分量提取出的手写体文字信息更清晰,具有更快的处理速度和更高的识别率,快递单图像平均处理时间为1.36 s,识别率为89%。结论单独利用Cb颜色分量信息提取手写文字就可得到较好的提取效果,算法简单、可行。  相似文献   

19.
20.
Lip-reading technologies are rapidly progressing following the breakthrough of deep learning. It plays a vital role in its many applications, such as: human-machine communication practices or security applications. In this paper, we propose to develop an effective lip-reading recognition model for Arabic visual speech recognition by implementing deep learning algorithms. The Arabic visual datasets that have been collected contains 2400 records of Arabic digits and 960 records of Arabic phrases from 24 native speakers. The primary purpose is to provide a high-performance model in terms of enhancing the preprocessing phase. Firstly, we extract keyframes from our dataset. Secondly, we produce a Concatenated Frame Images (CFIs) that represent the utterance sequence in one single image. Finally, the VGG-19 is employed for visual features extraction in our proposed model. We have examined different keyframes: 10, 15, and 20 for comparing two types of approaches in the proposed model: (1) the VGG-19 base model and (2) VGG-19 base model with batch normalization. The results show that the second approach achieves greater accuracy: 94% for digit recognition, 97% for phrase recognition, and 93% for digits and phrases recognition in the test dataset. Therefore, our proposed model is superior to models based on CFIs input.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号