首页 | 本学科首页   官方微博 | 高级检索  
     

结合密集神经网络与长短时记忆模型的中文识别
引用本文:张艺玮,赵一嘉,王馨悦,董兰芳.结合密集神经网络与长短时记忆模型的中文识别[J].计算机系统应用,2018,27(11):35-41.
作者姓名:张艺玮  赵一嘉  王馨悦  董兰芳
作者单位:中国科学技术大学 计算机科学与技术学院, 合肥 230022,辽宁省实验中学, 沈阳 110031,中国科学技术大学 计算机科学与技术学院, 合肥 230022,中国科学技术大学 计算机科学与技术学院, 合肥 230022
摘    要:文本图像识别是计算机视觉领域一项重要任务,而其中的中文识别因种类繁多、结构复杂以及类间相近等特点很具挑战性.为改善这一问题,使用文本行端到端的识别模型.首次提出利用密集卷积神经网络(DenseNet)提取文本图像底层特征,同时避免手工设计、统计图像特征的繁琐;将整行图像特征直接送入双向长短时记忆模型(BLSTM)进行局部相关性分析,减少字符定位分割这一步骤;最后采用时域连接模型(CTC)解码获得识别的文本信息.实验表明所提出的模型可以高效的进行图像文本行的识别,并对图像的多种形变具有较好的鲁棒性.

关 键 词:中文识别  端到端  密集卷积神经网络  双向长短时记忆模型  时域连接模型
收稿时间:2018/4/11 0:00:00
修稿时间:2018/5/11 0:00:00

Chinese Recognition Based on Dense Convolutional Network and Bidirectional Long Short-Term Memory Model
ZHANG Yi-Wei,ZHAO Yi-Ji,WANG Xin-Yue and DONG Lan-Fang.Chinese Recognition Based on Dense Convolutional Network and Bidirectional Long Short-Term Memory Model[J].Computer Systems& Applications,2018,27(11):35-41.
Authors:ZHANG Yi-Wei  ZHAO Yi-Ji  WANG Xin-Yue and DONG Lan-Fang
Affiliation:School of Computer Science and Technology, University of Science and Technology of China, Hefei 230022, China,Liaoning Provincial Shiyan High School, Shenyang 110031, China,School of Computer Science and Technology, University of Science and Technology of China, Hefei 230022, China and School of Computer Science and Technology, University of Science and Technology of China, Hefei 230022, China
Abstract:Text recognition is an important task in computer vision. The recognition of Chinese texts is challenging because of its wide range, complicated structure, and similar classes. In order to improve this problem, an end-to-end recognition model of text is used. The proposed model uses Dense convolutional Network (DenseNet) to extract features of text images, avoiding artificial design and statistics features. Then, the features are sent to Bidirectional Long Short-Term Memory model (BLSTM) for correlation analysis of local data. This step avoids the character segmentation. Finally, the Connectionist Temporal Classifier (CTC) is used to decode the text information. Experiments show that the proposed model can effectively recognize text images, and has strong robustness to various deformed images.
Keywords:Chinese recognition  end-to-end  Dense convolutional Network (DenseNet)  Bidirectional Long Short-Term Memory (BLSTM)  Connectionist Temporal Classifier (CTC)
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号