首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于页眉线的扭曲文档图像快速校正方法
引用本文:曾凡锋,段漾波. 一种基于页眉线的扭曲文档图像快速校正方法[J]. 图学学报, 2016, 37(1): 79. DOI: 10.11996/JG.j.2095-302X.2016010079
作者姓名:曾凡锋  段漾波
摘    要:在对文档图像进行光学字符识别时,由于书籍扭曲的存在,识别率会降低。对于含有页眉页脚线的扭曲文档图像,提出一种快速校正方法。首先分别检测并定位图像中的页眉线,保存页眉线的坐标信息。根据等比算法计算页眉线上各点在校正时所需向上或向下移动的距离,然后以此距离为参数扫描图像,计算页眉页脚线之间的各个目标像素校正所需移动的距离,同时进行像素点的移动重构图像,最终得到校正的图像。实验结果表明,该方法校正效果明显,对于包含页眉页脚线的扭曲文档图像有较好的校正效果,校正后OCR 识别率大幅度提高。

关 键 词:计算机应用  扭曲文档  页眉页脚线  等比距离  图像校正  

A Correcting Method Based on Header and Footer Line forWarped Documnet Images
Zeng Fanfeng,Duan Yangbo. A Correcting Method Based on Header and Footer Line forWarped Documnet Images[J]. Journal of Graphics, 2016, 37(1): 79. DOI: 10.11996/JG.j.2095-302X.2016010079
Authors:Zeng Fanfeng  Duan Yangbo
Abstract:The recognition rate of OCR (optical character recognition) is low because of the warpeddocument images. For those warped document images with header and footer lines, a fast method isproposed to increase the rate of OCR in this paper. Firstly, the location of the header line is detectedand restored in the document image. Then the distance of the line moving upward or downward iscalculated based on geometric algorithm. After that, the image is scanned using the distance asparameters and the distance that every target pixel needs to remove is calculated. At the same time, allpixel are removed in order to restructure the image and then a well corrected image is obtained.Experiments demonstrated that this correcting method was efficient. The OCR rate of warpeddocument image with header line could be significantly improved.
Keywords:computer application,warped document,header and footer line,geometric distance  image correct,
本文献已被 CNKI 等数据库收录!
点击此处可从《图学学报》浏览原始摘要信息
点击此处可从《图学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号