首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
赵飞  谢里阳  李佳 《计算机应用》2011,31(6):1631-1633
针对由照相机扫描仪等文档获取设备拍摄的文档图像可能存在倾斜,进而导致光学字符识别(OCR)软件不能正确识别的情况,采用了一种以文档图像投影栅缝宽为目标函数的优化方法,栅缝宽最大值对应的投影角度的相反数即为文档图像的倾斜角。利用栅线宽函数扩大了检测范围,并提高了检测速度;利用反投影法和均布列预投影等方法,减少了计算量;利用二分法提高了算法的检测精度。通过一些包含少量插图的文档图像的倾斜角检测实验,验证了该方法的有效性。  相似文献   

2.
计算机光学乐谱识别技术是将传统的纸质型乐谱转化为计算机能够“读懂”的数字音乐,在计算机音乐领域中具有重要的应用价值、乐谱识别系统的输入是乐谱扫描图像,而扫描过程中出现的图像倾斜现象,会给识别过程中的谱线定位和谱段切割带来诸多困难,必须对图像作有效的倾斜校正以保证系统的性能。为此,提出了一种快速的乐谱图像倾角检测方法。该方法首先利用乐谱文档的自身结构特点,对图像进行预处理,滤除乐谱图像中不具备方向性的干扰像素,然后通过多组图像水平投影队列间的交叉相关性计算对倾角进行检测。其特点是在确保检测倾角精度的同时具有非常高的执行效率。实验结果表明这一方法是有效、实用的。  相似文献   

3.
为解决朝鲜语古籍数字化中朝汉文种混排字符切分困难的问题,提出一种朝鲜语古籍图像的文字切分算法。针对古籍列与列之间存在不连续间隔线、倾斜或者粘连等问题,提出一种基于连通域投影的列切分方法。利用连通域的删除、合并、拆分等操作对文字进行切分。使用一种多步切分法完成了具有文字大小不一,横向、纵向混合排版特点图像的字符切分工作。对于粘连字,采用改进的滴水算法进行有效切分。实验结果表明所提出的算法能够很好地完成朝、汉文种混排,文字大小不一,排版情况复杂的朝鲜语古籍图像的文字切分工作。该算法的列切分准确率为97.69%,字切分准确率为87.79%。  相似文献   

4.
The digitalization processes of documents produce frequently images with small rotation angles. The skew angles in document images degrade the performance of optical character recognition (OCR) tools. Therefore, skew detection of document images plays an important role in automatic document analysis systems. In this paper, we propose a Rectangular Active Contour Model (RAC Model) for content region detection and skew angle calculation by imposing a rectangular shape constraint on the zero-level set in Chan–Vese Model (C-V Model) according to the rectangular feature of content regions in document images. Our algorithm differs from other skew detection methods in that it does not rely on local image features. Instead, it uses global image features and shape constraint to obtain a strong robustness in detecting skew angles of document images. We experimented on different types of document images. Comparing the results with other skew detection algorithms, our algorithm is more accurate in detecting the skews of the complex document images with different fonts, tables, illustrations, and layouts. We do not need to pre-process the original image, even if it is noisy, and at the same time the rectangular content region of a document image is also detected.  相似文献   

5.
基于视窗的OCR页面图像倾斜检测方法   总被引:2,自引:0,他引:2       下载免费PDF全文
文档在扫描输入过程中,所生成的页面图像一般都存在一定的角度倾斜,当页面图像倾斜角度过大时,将对进一步的版面分析以及字符识别产生不良影响。为了快速准确地检测页面图像倾斜角度和降低计算量,提出了一种基于视窗变换的页面图像倾斜检测方法,该算法首先对视窗中的文字及图片的细节部分进行模糊,然后对其边沿进行直线拟合,以便快速检测页面图像倾斜角度。实验结果表明,该方法能快速准确地检测出各类页面图像的倾斜角度,并具有良好的适应性。  相似文献   

6.
A new method is proposed to solve the document identification and skew detection problem. It can be applied to a widely used subclass of documents which resemble in style an application form. Unlike other approaches, we make no assumptions about the nature and/or style of the printed form. An attempt is made to solve the problem in the most general sense. The method presented here does not rely on any special features such as patterns of line crossings, or dominant lines, or even special symbols found only on specially designed forms. The Power Spectral Density of the horizontal projection profile of the form is used as a shift invariant feature vector. The Karhunen-Loeve transform is employed to de-correlate and reduce the length of the feature vectors in the training set. Training is done in such a way that no rotations of the unknown form are necessary during recognition. The eigenvectors of the covariance matrix of the power spectral densities for the training set, along with learning vector quantization, were used for training, and the Euclidean distance, for recognition. A limitation related to the amount of skew that the system can handle is alleviated with the use of a known skew detection method.  相似文献   

7.
A Document Skew Detection Method Using the Hough Transform   总被引:4,自引:0,他引:4  
Document image processing has become an increasingly important technology in the automation of office documentation tasks. Automatic document scanners such as text readers and OCR (Optical Character Recognition) systems are an essential component of systems capable of those tasks. One of the problems in this field is that the document to be read is not always placed correctly on a flatbed scanner. This means that the document may be skewed on the scanner bed, resulting in a skewed image. This skew has a detrimental effect on document on document analysis, document understanding, and character segmentation and recognition. Consequently, detecting the skew of a document image and correcting it are important issues in realising a practical document reader. In this paper we describe a new algorithm for skew detection. We then compare the performance and results of this skew detection algorithm to other publidhed methods form O'Gorman, Hinds, Le, Baird, Posel and Akuyama. Finally, we discuss the theory of skew detection and the different apporaches taken to solve the problem of skew in documents. The skew correction algorithm we propose has been shown to be extremenly fast, with run times averaging under 0.25 CPU seconds to calculate the angle on the DEC 5000/20 workstation. Received: 21 November 1998, Received in revised form: 25 August 1999, Accepted: 20 October 1999  相似文献   

8.
In the digital world, a wide range of handwritten and printed documents should be converted to digital format using a variety of tools, including mobile phones and scanners. Unfortunately, this is not an optimal procedure, and the entire document image might be degraded. Imperfect conversion effects due to noise, motion blur, and skew distortion can lead to significant impact on the accuracy and effectiveness of document image segmentation and analysis in Optical Character Recognition (OCR) systems. In Document Image Analysis Systems (DIAS), skew estimation of images is a crucial step. In this paper, a novel, fast, and reliable skew detection algorithm based on the Radon Transform and Curve Length Fitness Function (CLF), so-called Radon CLF, was proposed. The Radon CLF model aims to take advantage of the properties of Radon spaces. The Radon CLF explores the dominating angle more effectively for a 1D signal than it does for a 2D input image due to an innovative fitness function formulation for a projected signal of the Radon space. Several significant performance indicators, including Mean Square Error (MSE), Mean Absolute Error (MAE), Peak Signal-to-Noise Ratio (PSNR), Structural Similarity Measure (SSIM), Accuracy, and run-time, were taken into consideration when assessing the performance of our model. In addition, a new dataset named DSI5000 was constructed to assess the accuracy of the CLF model. Both two- dimensional image signal and the Radon space have been used in our simulations to compare the noise effect. Obtained results show that the proposed method is more effective than other approaches already in use, with an accuracy of roughly 99.87% and a run-time of 0.048 (s). The introduced model is far more accurate and time-efficient than current approaches in detecting image skew.  相似文献   

9.
基于改进Hough变换的文本图像倾斜校正方法   总被引:2,自引:0,他引:2  
文本图像在扫描输入时产生的倾斜现象会对后续的页面分割及光学字符识别(OCR)处理产生很大的影响,而传统的标准Hough变换虽然具有对噪声不敏感,不依赖于直线连续性的优点,但由于计算量偏大,速度慢,在实用时有较大的局限性。提出一种基于改进的Hough变换的文本图像倾斜校正方法,通过在变分辨率图像中采用不同的文本方向提取算法,及选择合理投票门限等改进Hough变换的措施,减小了由图像区域及文字笔画粗细所产生的对倾角判定的不利影响,并使用基于偏移值的方法实现页面倾斜的快速校正。实验结果表明,该算法实现了大范围高精度的文本图像倾角的快速检测,具有较强的实用性。  相似文献   

10.
In this paper, we strive towards the development of efficient techniques in order to segment document pages resulting from the digitization of historical machine-printed sources. This kind of documents often suffer from low quality and local skew, several degradations due to the old printing matrix quality or ink diffusion, and exhibit complex and dense layout. To face these problems, we introduce the following innovative aspects: (i) use of a novel Adaptive Run Length Smoothing Algorithm (ARLSA) in order to face the problem of complex and dense document layout, (ii) detection of noisy areas and punctuation marks that are usual in historical machine-printed documents, (iii) detection of possible obstacles formed from background areas in order to separate neighboring text columns or text lines, and (iv) use of skeleton segmentation paths in order to isolate possible connected characters. Comparative experiments using several historical machine-printed documents prove the efficiency of the proposed technique.  相似文献   

11.
Marginal noise is a common phenomenon in document analysis which results from the scanning of thick documents or skew documents. It usually appears in the front of a large and dark region around the margin of document images. Marginal noise might cover meaningful document objects, such as text, graphics and forms. The overlapping of marginal noise with meaningful objects makes it difficult to perform the task of segmentation and recognition of document objects. This paper proposes a novel approach to remove marginal noise. The proposed approach consists of two steps which are marginal noise detection and marginal noise deletion. Marginal noise detection will reduce an original document image into a smaller image, and then find marginal noise regions according to the shape length and location of the split blocks. After the detection of marginal noise regions, different removal methods are performed. A local thresholding method is proposed for the removal of marginal noise in gray-scale document images, whereas a region growing method is devised for binary document images. Experimenting with a wide variety of test samples reveals the feasibility and effectiveness of our proposed approach in removing marginal noises.  相似文献   

12.
目的 在光学字符识别中,文本图像经常会出现一定角度的倾斜.为将倾斜的文本图像校正,以便于字符识别中的后续处理,快速准确地检测倾斜文本图像的倾角是非常重要的.方法 对基于投影轮廓的算法进行改进,提出了一种两级投影直方图方差的算法(TPHV).首先在预定的角度范围内以一定角度步长对选定的图像区域做多方向投影,以获取投影直方图;然后计算各角度投影直方图的均方差,求出所有投影直方图方差的最大差分,将对应的投影角度作为倾角的粗略估值,最后以粗略估值为中心,以第1次投影步长为半径的角度范围内,再次以给定的检测精度为步长进行投影,重复第1次投影的工作,求出投影直方图方差的最大值,以对应的角度作为图像倾角的检测值.结果 该算法能够处理各种复杂的文本图像;对于诸如2 480×3 508像素的较大图像,可在200 ms左右的时间内完成倾角的检测;可检测的倾角范围不受限制;对相关网站提供的5组共500幅测试图像检测误差绝对值均值不超过0.5°,最大值不超过0.7°,检测误差的方差不超过0.1.结论 实验结果表明,该算法具有明显优势:速度快,倾斜角度检测精度高,误差集中,检测范围大,对噪声不敏感,具有广泛的适用性,适合于复杂的排版方式.  相似文献   

13.
基于纹理梯度的文档图像的倾斜校正方法   总被引:3,自引:0,他引:3  
文档图像的倾斜校正在光学字符识别以及文档理解系统研究中有着重要的意义,国内外学者提出了很多实现方法,但各种方法都存在一定的局限性.通过对基于Hough变换和投影的倾斜校正方法的分析,提出了一种基于文档图像纹理方向的倾斜校正方法:文档图像中的文本纹理整体表现出一定的方向性,使文本图像能保持水平,通过纹理方向性分析,找出纹理的主要方向,进而求得文档的倾斜角度.通过一个复杂版面的二值文档图像的检测校正实验表明,方法提高了倾斜校正的校正范围,而且具有较好的有效性和鲁棒性.  相似文献   

14.
一种基于Hough变换的文档图像倾斜纠正方法   总被引:10,自引:2,他引:8  
李政  杨扬  颉斌  王宏 《计算机应用》2005,25(3):583-585
在对文本扫描输入的过程中,文本图像不可避免地会发生倾斜,倾斜校正将为图文分割、文字识别等后续处理工作创造良好的条件。提出了一种基于Hough变换的检测图像倾斜度的方法,为了克服Hough变换计算量大的缺点,该方法首先选取局部代表性子区域并提取其图像水平边缘,然后对提取的水平边缘进行两级Hough变换,从而实现了准确性与快速性的很好结合。  相似文献   

15.
16.
提出一种新的维吾尔语文字识别研究方法。首先,建立字符样本库,并对库中文字图像归一化。然后,将测试图像与样本图像进行垂直和水平双方向投影相关性检测,对与测试图像双投影相关性较高的样本字符进行笔画数特征提取,得到预分类结果。最后,将测试图像与预分类结果进行SIFT关键点检测、方向描述子生成与配准,与测试图片匹配点对最多的预分类结果为识别结果,并输出该结果标记符号对应的维吾尔语字符。实验结果表明:该方法能减少字符样本的数量,并有效解决测试图像尺度与几何形变的差异造成的匹配困难问题。  相似文献   

17.
为了克服因人脸图像检测引起的配准不稳定性和小样本引起的维数灾难,由一副二维人脸图像通过上下左右平移生成4个图像,把生成的图像与原来的图像一起加入训练样本集,构成新的训练图像集。基于二维图像,结合图像局部结构信息,设计了准则函数,获得双投影矩阵,抽取人脸特征。对待识别人脸图像,由它的扰动图像设计识别方法。与传统的人脸识别方法相比,该方法的识别效果更好;Yale和ORL人脸数据库上的实验结果验证了该方法的有效性。  相似文献   

18.
基于投影的文档图像倾斜校正方法   总被引:5,自引:0,他引:5       下载免费PDF全文
针对文档图像的倾斜校正问题,提出了一种新的基于投影的文档图像倾斜角检测方法。首先采用一种高效的像素遍历算法对文档图像从不同角度进行投影,然后对投影数据进行累加求和,通过比较不同角度下的累加和来确定倾斜角度。该方法在投影过程中只需对文档图像进行极少部分投影,因而大大减少了运算量。基于该方法的特点,提出了由“粗”到“精”的投影策略,在确保检测精度的同时大幅提高了检测速度。实验结果表明,方法非常有效,可以获得很高的检测精度。  相似文献   

19.
We present here an enhanced algorithm (e-PCP) for skew detection in scanned documents, based on the work on Piecewise Covering by Parallelogram (PCP) for robust determination of skew angles [C.-H. Chou, S.-Y. Chu, F. Chang, Estimation of skew angles for scanned documents based on piecewise covering by parallelograms, Pattern Recognition 40 (2007) 443-455]. Our algorithm achieves even better robustness for detection of skew angle than the original PCP algorithm. We have shown accurate determination of skew angles in document images where the original PCP algorithm fails. Further, the increased robustness of performance is achieved with reduced number of computation compared to the originally proposed PCP algorithm. The e-PCP algorithm also outputs a confidence measure which is important in automated systems to filter cases where the estimated skew angle may not be very accurate and thus can be handled by manual intervention. The proposed algorithm was tested extensively on all categories of real time documents and comparisons with PCP method is also provided. Useful details regarding faster execution of the proposed algorithm is provided in Appendix.  相似文献   

20.
基于直线连续性的页面倾斜检测与校正   总被引:14,自引:0,他引:14  
在文档扫描过程中,输入的文档图像不可避免地会发生倾斜现象,而布局分析及字符识别算法对页面倾斜都十分敏感,因此倾斜检测和校正是文档分析预处理的重要环节,文中提出了一个基于直线连续性的倾斜检测方法。它将字符连通区包围盒底边中心点作为特征点,利用文本行中特征点与基线的关系,计算出基线的方向,即为页面倾斜方向,接着,介绍了一种基于偏移值的倾斜校正方法,实验证明,该算法速度快,准确度高。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号