首页 | 本学科首页   官方微博 | 高级检索  
     

基于有向单连通链的表格框线检测算法
引用本文:郑冶枫,刘长松,丁晓青,潘世言.基于有向单连通链的表格框线检测算法[J].软件学报,2002,13(4):790-796.
作者姓名:郑冶枫  刘长松  丁晓青  潘世言
作者单位:清华大学,电子工程系,北京,100084
基金项目:国家自然科学基金资助项目(69972024); 863高科技发展计划基金资助项目(863-306-ZT03-03-1)
摘    要:表格框线检测是表格识别的基础.现有的表格框线检测算法或者速度慢,或者鲁棒性差,而且没有充分利用表格框线之间的约束信息.提出了一种基于所定义的图像结构基元"有向单连通链"的自底向上表格框线检测算法.在此算法中,有向单连通链是一种黑像素游程序列,作为非常合适的矢量基元,在引入一定表格框线约束信息的条件下合并单连通链,有效地去除伪框线,补全断裂的框线,提高了算法的鲁棒性,可以准确而快速地提取表格框线.通过滤除噪声单连通链,加快单连通链的合并速度,算法速度提高了3~10倍,满足了实用要求.实验证明,该算法具有速度

关 键 词:表格识别  图像分析  直线检测  OCR(光学字符识别)  智能文档处理
文章编号:1000-9825/2002/13(04)0790-07
收稿时间:2000/5/11 0:00:00
修稿时间:2000年5月11日

A Form Frame-Line Detection Algorithm Based on Directional Single-Connected Chain
ZHENG Ye-feng,LIU Chang-song,DING Xiao-qing and PAN Shi-yan.A Form Frame-Line Detection Algorithm Based on Directional Single-Connected Chain[J].Journal of Software,2002,13(4):790-796.
Authors:ZHENG Ye-feng  LIU Chang-song  DING Xiao-qing and PAN Shi-yan
Abstract:The existing form frame line detection algorithms are either time consuming or with low robustness. Furthermore, all these approaches do not use the constraint information between form frame lines. In this paper, a novel bottom-up form frame line detection algorithm is proposed based on the directional single-connected chain (DSCC). Defined as an array of black pixel run-lengths, DSCC works very well as an image structure element or a vector in this vectorization algorithm. By merging multiple DSCCs under some constraints,people are able to extract the form frame lines automatically yut fast.With the help of the con straints between form frame lines,the robustness of the approach is increased drastically by getting rid of pseudo lines and completing broken lines.Byfiltering DSCCs created by noise and speeding up the merging of DSCCs,the speed of this algorithm is comparable with the well-known projection method.Experimental results show that this algorithm is fast,resistant to moderate serious line break and skew of any angle.
Keywords:form recognition  image analysis  line detection  optical character recognition (OCR)  intelligent document processing  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号