Building efficient CNN architecture for offline handwritten Chinese character recognition |
| |
Authors: | Zhiyuan Li,Nanjun Teng,Min Jin,Huaxiang Lu |
| |
Affiliation: | 1. Lab of High-speed Circuit and Neural Networks,Institute of Semiconductors, CAS,Beijing,China;2.University of Chinese Academy of Sciences,Beijing,China;3.Beijing Key Laboratory of Semiconductor Neural Network Intelligent Sensing and Computing Technology,Beijing,China;4.CAS Center for Excellence in Brain Science and Intelligence Technology,Beijing,China |
| |
Abstract: | Deep convolutional neural networks-based methods have brought great breakthrough in image classification, which provides an end-to-end solution for handwritten Chinese character recognition (HCCR) problem through learning discriminative features automatically. Nevertheless, state-of-the-art CNNs appear to incur huge computational cost and require the storage of a large number of parameters especially in fully connected layers, which is difficult to deploy such networks into alternative hardware devices with limited computation capacity. To solve the storage problem, we propose a novel technique called weighted average pooling for reducing the parameters in fully connected layer without loss in accuracy. Besides, we implement a cascaded model in single CNN by adding mid output to complete recognition as early as possible, which reduces average inference time significantly. Experiments are performed on the ICDAR-2013 offline HCCR dataset. It is found that our proposed approach only needs 6.9 ms for classifying a character image on average and achieves the state-of-the-art accuracy of 97.1% while requires only 3.3 MB for storage. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|