A simple and effective table detection system from document images |
| |
Authors: | S. Mandal S. P. Chowdhury A. K. Das Bhabatosh Chanda |
| |
Affiliation: | (1) CST Department, Bengal Engineering College (DU), Sibpur, Howrah;(2) ECS Unit, Indian Statistical Unit, Calcutta, 700 035, India |
| |
Abstract: | The requirement of detection and identification of tables from document images is crucial to any document image analysis and digital library system. In this paper we report a very simple but extremely powerful approach to detect tables present in document pages. The algorithm relies on the observation that the tables have distinct columns which implies that gaps between the fields are substantially larger than the gaps between the words in text lines. This deceptively simple observation has led to the design of a simple but powerful table detection system with low computation cost. Moreover, mathematical foundation of the approach is also established including formation of a regular expression for ease of implementation. |
| |
Keywords: | Table detection Document image segmentation Digital document library |
本文献已被 SpringerLink 等数据库收录! |