首页 | 本学科首页   官方微博 | 高级检索  
     


Bidirectional extraction and recognition of scene text with layout consistency
Authors:Ryota Hinami  Xinhao Liu  Naoki Chiba  Shin’ichi Satoh
Affiliation:1.The University of Tokyo,Tokyo,Japan;2.Tokyo Institute of Technology,Tokyo,Japan;3.Rakuten, Inc.,Tokyo,Japan;4.National Institute of Informatics,Tokyo,Japan
Abstract:Text recognition in natural scene images is a challenging task that has recently been garnering increased research attention. In this paper, we propose a method for recognizing text by utilizing the layout consistency of a text string. We estimate the layout (four lines of a text string) using initial character extraction and recognition result. On the basis of the layout consistency across a word, we perform character extraction and recognition again using four lines, which is more accurate than the first process. Our layout estimation method is different from previous methods in terms of exploiting character recognition results and its use of a class-conditional layout model. More accurate and robust estimation is achieved, and it can be used to refine character extraction and recognition. We call this two-way process—from extraction and recognition to layout, and from layout to extraction and recognition—“bidirectional” to discriminate it from previous feedback refinement approaches. Experimental results demonstrate that our bidirectional processes provide a boost to the performance of word recognition.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号