首页 | 本学科首页   官方微博 | 高级检索  
     

文本页面图像的图文分割与分类算法
引用本文:王加俊,黄贤武,郭玮玮,仲兴荣.文本页面图像的图文分割与分类算法[J].中国图象图形学报,2004,9(5):571-577.
作者姓名:王加俊  黄贤武  郭玮玮  仲兴荣
作者单位:苏州大学电子信息学院 苏州215021 (王加俊,黄贤武,郭玮玮),苏州大学电子信息学院 苏州215021(仲兴荣)
基金项目:江苏省教育厅自然科学基金项目 ( L 0 112 41992 5 ),江苏省自然科学基金项目 ( BK2 0 0 113 7)
摘    要:为了能对包含不规则图片区和表格的倾斜文本页面图像进行图文分割与分类,提出了一种新的图文分割和分类算法。该算法先采用数学形态学和分级霍夫变换来进行文本倾斜的检测和校正;然后为了使算法能够对包含不规则图片区的文本页面图像进行处理,提出在传统的投影轮廓切割算法中,引入中点切割的过程,以便利用一系列的矩形来近似地逼近不规则的图片区。对于分割后的图像,则提出利用黑白像素比(Rbw)和近邻像素间的交叉相关性(Rcc)两个特征来作为分类的判据。实验结果证明,算法速度快、可靠性高。该算法只适用于二值图像。

关 键 词:文本图像  图文分割  分类算法  形态学  霍夫变换  二值图像  电子文件
文章编号:1006-8961(2004)05-0571-07

Page Segmentation and Classification Algorithm for Document Images
WANG Jia-jun,HUANG Xian-wu,Guo Wei-wei,ZHONG Xing-rong,WANG Jia-jun,HUANG Xian-wu,Guo Wei-wei,ZHONG Xing-rong,WANG Jia-jun,HUANG Xian-wu,Guo Wei-wei,ZHONG Xing-rong and WANG Jia-jun,HUANG Xian-wu,Guo Wei-wei,ZHONG Xing-rong.Page Segmentation and Classification Algorithm for Document Images[J].Journal of Image and Graphics,2004,9(5):571-577.
Authors:WANG Jia-jun  HUANG Xian-wu  Guo Wei-wei  ZHONG Xing-rong  WANG Jia-jun  HUANG Xian-wu  Guo Wei-wei  ZHONG Xing-rong  WANG Jia-jun  HUANG Xian-wu  Guo Wei-wei  ZHONG Xing-rong and WANG Jia-jun  HUANG Xian-wu  Guo Wei-wei  ZHONG Xing-rong
Abstract:In this paper, a system valid of the segmentation and classification of skewed document images with irregular graph regions and form regions is proposed. In this system, the skew angle of the document images is detected with a novel algorithm based on the morphological operation of Hit-or-Miss and the hierarchical Hough transform. The former(Hit-or-Miss operation) is for the detection of the baseline points while the latter(Hough transform) is for the detection of the skew angle of the baseline which is also of the page image. To make the system valid for the document images with irregular graph regions involved, we proposed to introduce a middle point cut process to the traditional projection profile cut algorithm so that the irregular graph regions can be approximated with a lot of small rectangles. The segmented regions are classified with two features of the black to white ratio and the cross correlation between adjacent pixels of the sub-blocks. Experimental results have proved the fastness and the reliability of the system proposed in this paper.
Keywords:document image  morphological operation  image segmentation  hough  transform
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中国图象图形学报》浏览原始摘要信息
点击此处可从《中国图象图形学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号