首页 | 本学科首页   官方微博 | 高级检索  
     

基于页面前景和最小二乘法的倾斜校正
引用本文:陈 波,王加俊,吴 陈.基于页面前景和最小二乘法的倾斜校正[J].计算机工程,2007,33(15):202-204.
作者姓名:陈 波  王加俊  吴 陈
作者单位:苏州大学电子信息学院 苏州215021(陈波,王加俊),江苏科技大学电子信息学院 镇江212003(吴陈)
摘    要:鉴于页面版面复杂,提出了一种基于页面前景和最小二乘法的倾斜校正方法。该方法用特定的模式描述页面前景像素,利用模式粗分类分离页面中可能有的图像、图形和表格,通过合并余下的模式得到最大的文字模式结构体,依据该结构体所含基线特征点用最小二乘法拟合出基线方向即页面倾斜方向。实验表明该方法是有效的,速度快,它得到的模式结构体可以继续用来做版面分析。

关 键 词:倾斜校正  模式结构体  基线特征点  版面分析
文章编号:1000-3428(2007)15-0202-03
修稿时间:2006-08-10

Document Image Skew Correction Based on Page Layout Foreground and Least Square Method
CHEN Bo,WANG Jia-jun,WU Chen.Document Image Skew Correction Based on Page Layout Foreground and Least Square Method[J].Computer Engineering,2007,33(15):202-204.
Authors:CHEN Bo  WANG Jia-jun  WU Chen
Affiliation:1. School of Electronics and Information, Soochow University, Suzhou 215021; 2. School of Electronics and Information, Jiangsu University of Science and Technology, Zhenjiang 212003
Abstract:For the complexity of document images, this paper proposes a method based on page’s layout foreground and least square method. In this method, foreground pixels are described by special patterns. Halftones, graphics and forms are excluded from the document images by pattern classification. The biggest pattern structure is obtained after merging the rest character pattern. The skew angle is counted by using the least square method according to the points, which is obtained by searching the biggest pure text pattern structure. Experimental result shows the fastness and effectiveness of the proposed algorithm. A most prominent superiority of this method is that patterns obtained in the process of skew angle detection can be used for further layout analysis.
Keywords:skew correction  pattern structure  characteristic dots on baseline  page layout analysis
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号