首页 | 本学科首页   官方微博 | 高级检索  
     


Coupled snakelets for curled text-line segmentation from warped document images
Authors:Syed Saqib Bukhari  Faisal Shafait  Thomas M Breuel
Affiliation:1. Image Understanding and Pattern Recognition Research, Department of Computer Science Technical, University of Kaiserslautern, 67663, Kaiserslautern, Germany
2. Multimedia Analysis and Data Mining Competence Center, German Research Center for Artificial Intelligence (DFKI), 67663, Kaiserslautern, Germany
Abstract:Camera-captured, warped document images usually contain curled text-lines because of distortions caused by camera perspective view and page curl. Warped document images can be transformed into planar document images for improving optical character recognition accuracy and human readability using monocular dewarping techniques. Curled text-lines segmentation is a crucial initial step for most of the monocular dewarping techniques. Existing curled text-line segmentation approaches are sensitive to geometric and perspective distortions. In this paper, we introduce a novel curled text-line segmentation algorithm by adapting active contour (snake). Our algorithm performs text-line segmentation by estimating pairs of x-line and baseline. It estimates a local pair of x-line and baseline on each connected component by jointly tracing top and bottom points of neighboring connected components, and finally each group of overlapping pairs is considered as a segmented text-line. Our algorithm has achieved curled text-line segmentation accuracy of above 95% on the DFKI-I (CBDAR 2007 dewarping contest) dataset, which is significantly better than previously reported results on this dataset.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号