首页 | 本学科首页   官方微博 | 高级检索  
     

局部高亮干扰文本图像的二值化方法研究
引用本文:孙洁娣,温江涛,李书茉,任瑞军.局部高亮干扰文本图像的二值化方法研究[J].光电工程,2012,39(11):75-80.
作者姓名:孙洁娣  温江涛  李书茉  任瑞军
作者单位:1. 燕山大学信息科学与工程学院,河北秦皇岛,066004
2. 燕山大学河北省测试计量技术及仪器重点实验室,河北秦皇岛,066004
3. 中国石油天然气管道通信电力工程总公司,河北廊坊,065000
基金项目:国家自然基金资助项目 (61102110)
摘    要:本文提出一种新的基于Curvelet变换的文本图像二值化处理方法,以消除文本图像中局部高亮度区域对二值化图像质量的影响.首先对具有局部高亮度区域干扰的原始文本图像进行Curvelet变换,得到图像在曲波域的Curvelet系数集;然后根据各Curvelet系数所表征的图像特征,对Curvelet系数进行非线性增强,以优化文本图像的直方图分布;对增强的Curvelet系数集进行反变换,得到直方图优化后的时域图像,进而应用Otsu方法实现文本图像二值化.应用本文方法对具有带状及点状局部高亮度区域的文本图像进行二值化处理,并采用ABBYYFineReaderl0对二值图像进行OCR识别.实验结果表明,通过本文提出的处理方法所得到的二值化图像,其字符的OCR识别准确率最高可达94.81%,优于其他四种典型的图像二值化处理方法.

关 键 词:文本图像二值化  局部高亮干扰  多尺度处理  Curvelet变换
收稿时间:2012/4/20

Binarization Method for the Document Images with Local Highlight Interference
SUN Jie-di , WEN Jiang-tao , LI Shu-mo , REN Rui-jun.Binarization Method for the Document Images with Local Highlight Interference[J].Opto-Electronic Engineering,2012,39(11):75-80.
Authors:SUN Jie-di  WEN Jiang-tao  LI Shu-mo  REN Rui-jun
Abstract:A novel binarization method for document images based on Curvelet transform is presented. The interference caused by local high lightness is eliminated to get a better image quality. Firstly, the Curvelet transformation is applied to the document images with local high lightness area, and the Curvelet coefficients can be got. Then, according to the feature of images represented by Curvelet coefficients, the Curvelet coefficients are enhanced nonlinearly to optimize the histogram distribution. Curvelet coefficients are transformed inversely to get the images, and then the Otsu method is applied to get the binary image. According to the binarized image, the OCR recognition results are got by the ABBYY FineReader10. Experimental results show that the highest recognition accuracy of characters could reach 94.81%. The performance of this method is better than the other four typical binarization methods.
Keywords:document image binarization  local highlight interference  multi-scale processing  Curvelet transform
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号