首页 | 本学科首页   官方微博 | 高级检索  
     

一种表格框线检测和字线分离算法
引用本文:刘长松,潘世言,郑冶枫,丁晓青.一种表格框线检测和字线分离算法[J].电子与信息学报,2002,24(9):1190-1196.
作者姓名:刘长松  潘世言  郑冶枫  丁晓青
作者单位:清华大学电子工程系智能技术与系统国家重点实验室,北京,100084
基金项目:国家863计划,国家自然科学基金
摘    要:该文提出了一种基于有向单连通链的表格框线检测算法,能够合理地利用单连通链边沿的全局统计特性和单连通链之间的局部位置关系,精确地提取表格框线,具有抗倾斜,抗断裂,抗字线交叠等优点。在此基础上,提出了一种能够分离交叠字线的表格框线去除算法,并成功应用于实际的表格识别系统中。

关 键 词:表格识别    图像分析    直线检测    字符识别
收稿时间:2000-10-8
修稿时间:2000年10月8日

A frame line detection and removal algorithm for form document recognition
Liu Changsong,Pan Shiyan,Zheng Ycfeng,Ding Xiaoqing.A frame line detection and removal algorithm for form document recognition[J].Journal of Electronics & Information Technology,2002,24(9):1190-1196.
Authors:Liu Changsong  Pan Shiyan  Zheng Ycfeng  Ding Xiaoqing
Affiliation:State Key Lab. of Intell. Tech. & Sys., Dept. of EE Tsinghua Univ.,Beijing 100084 China
Abstract:A new frame line detection algorithm based on the structural image element-Directional Single-Connected Chain (DSCC) is proposed. Taking advantages of the global statistical property of the edges of the DSCCs, and their local mutual relations, the algorithm is able to accurately extract frame lines from scanned form images. It demonstrates the desired performance of insensitive to line slant, breaks as well as touches from character strokes inside the form cells. Based on this algorithm, a frame line removal approach is presented, by which the frame line can be removed without affecting the touched character strokes.
Keywords:Form recognition  linage analysis  Line detection  Character recognition  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号