Bidirectional extraction and recognition of scene text with layout consistency期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Bidirectional extraction and recognition of scene text with layout consistency

Authors:	Ryota Hinami Xinhao Liu Naoki Chiba Shin’ichi Satoh

Affiliation:	1.The University of Tokyo,Tokyo,Japan;2.Tokyo Institute of Technology,Tokyo,Japan;3.Rakuten, Inc.,Tokyo,Japan;4.National Institute of Informatics,Tokyo,Japan

Abstract:	Text recognition in natural scene images is a challenging task that has recently been garnering increased research attention. In this paper, we propose a method for recognizing text by utilizing the layout consistency of a text string. We estimate the layout (four lines of a text string) using initial character extraction and recognition result. On the basis of the layout consistency across a word, we perform character extraction and recognition again using four lines, which is more accurate than the first process. Our layout estimation method is different from previous methods in terms of exploiting character recognition results and its use of a class-conditional layout model. More accurate and robust estimation is achieved, and it can be used to refine character extraction and recognition. We call this two-way process—from extraction and recognition to layout, and from layout to extraction and recognition—“bidirectional” to discriminate it from previous feedback refinement approaches. Experimental results demonstrate that our bidirectional processes provide a boost to the performance of word recognition.

Keywords:
本文献已被 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏