首页 | 本学科首页   官方微博 | 高级检索  
     


A simple and effective table detection system from document images
Authors:S Mandal  S P Chowdhury  A K Das  Bhabatosh Chanda
Affiliation:(1) CST Department, Bengal Engineering College (DU), Sibpur, Howrah;(2) ECS Unit, Indian Statistical Unit, Calcutta, 700 035, India
Abstract:The requirement of detection and identification of tables from document images is crucial to any document image analysis and digital library system. In this paper we report a very simple but extremely powerful approach to detect tables present in document pages. The algorithm relies on the observation that the tables have distinct columns which implies that gaps between the fields are substantially larger than the gaps between the words in text lines. This deceptively simple observation has led to the design of a simple but powerful table detection system with low computation cost. Moreover, mathematical foundation of the approach is also established including formation of a regular expression for ease of implementation.
Keywords:Table detection  Document image segmentation  Digital document library
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号