首页 | 本学科首页   官方微博 | 高级检索  
     


Text detection in chart images
Authors:N Vassilieva  Y Fomina
Affiliation:14411. HP Labs, 1 Artillerijskaya str., St. Petersburg, 191104, Russia
24411. Studio Mobile, 18A Bolshoy Prospekt, St. Petersburg, 197198, Russia
Abstract:Common OCR (Optical Character Recognition) systems fail to detect and recognize small text strings of few characters, in particular when a text line is not horizontal. Such text regions are typical for chart images. In this paper we present an algorithm that is able to detect small text regions regardless of string orientation and font size or style. We propose to use this algorithm as a preprocessing step for text recognition with a common OCR engine. According to our experimental results, one can get up to 20 times better text recognition rate, and 15 times higher text recognition precision when the proposed algorithm is used to detect text location, size and orientation, before using an OCR system. Experiments have been performed on a benchmark set of 1000 chart images created with the XML/SWF Chart tool, which contain about 14000 text regions in total.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号