A Novel Scene Text Recognition Method Based on Deep Learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

A Novel Scene Text Recognition Method Based on Deep Learning

Authors:	Maosen Wang Shaozhang Niu Zhenguang Gao

Affiliation:	1Beijing Key Lab of Intelligent Telecommunication Software and Multimedia, Beijing University of Posts and Telecommunications, Beijing, 100876, China. 2Department of Computer Science, Framingham State University, Framingham, MA 01772, USA.

Abstract:	Scene text recognition is one of the most important techniques in pattern recognition and machine intelligence due to its numerous practical applications. Scene text recognition is also a sequence model task. Recurrent neural network (RNN) is commonly regarded as the default starting point for sequential models. Due to the non-parallel prediction and the gradient disappearance problem, the performance of the RNN is difficult to improve substantially. In this paper, a new TRDD network architecture which base on dilated convolution and residual block is proposed, using Convolutional Neural Networks (CNN) instead of RNN realizes the recognition task of sequence texts. Our model has the following three advantages in comparison to existing scene text recognition methods: First, the text recognition speed of the TRDD network is much fast than the state-of-the-art scene text recognition network based recurrent neural networks (RNN). Second, TRDD is easier to train, avoiding the problem of exploding and vanishing, which is major issue for RNN. Third, both using larger dilated factors and increasing the filter size are all viable ways to change receptive field size. We benchmark the TRDD on four standard datasets, it has higher recognition accuracy and faster recognition speed based on the smaller model. It is hopefully used in the real-time application.

Keywords:	Scene text recognition dilated convolution CTC CNN TCN

	点击此处可从《计算机、材料和连续体（英文）》浏览原始摘要信息
	点击此处可从《计算机、材料和连续体（英文）》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏