首页 | 本学科首页   官方微博 | 高级检索  
     

面向精准价格牌识别的多任务循环神经网络
引用本文:牟永强,范宝杰,孙超,严蕤,郭怡适.面向精准价格牌识别的多任务循环神经网络[J].自动化学报,2022,48(2):608-614.
作者姓名:牟永强  范宝杰  孙超  严蕤  郭怡适
作者单位:1.广州图匠数据科技有限公司人工智能实验室 广州 510310
摘    要:为了促进智能新零售在线下业务场景的发展, 提高作为销售关键信息价格牌的识别精度. 本文对价格牌识别问题进行研究, 有效地提高了价格牌的识别精度, 并解决小数点定位不准确的难题. 通过深度卷积神经网络提取价格牌的深度语义表达特征, 将提取到的特征图送入多任务循环网络层进行编码, 然后根据解码网络设计的注意力机制解码出价格数字, 最后将多个分支的结果整合并输出完整价格. 本文所提出的方法能够非常有效地提高线下零售场景价格牌的识别精度, 并解决了一些领域难题如小数点的定位问题, 此外, 为了验证本文方法的普适性, 在其他场景数据集上进行了对比实验, 相关结果也验证了本文方法的有效性.

关 键 词:卷积神经网络    循环神经网络    文本识别    多任务学习    价格牌识别
收稿时间:2019-09-06

Towards Accurate Price Tag Recognition Algorithm With Multi-task RNN
MOU Yong-Qiang,FAN Bao-Jie,SUN Chao,YAN Rui,GUO Yi-Shi.Towards Accurate Price Tag Recognition Algorithm With Multi-task RNN[J].Acta Automatica Sinica,2022,48(2):608-614.
Authors:MOU Yong-Qiang  FAN Bao-Jie  SUN Chao  YAN Rui  GUO Yi-Shi
Affiliation:1.AI-Labs, Guangzhou Image Data Technology Co., Ltd., Guangzhou 5103102.College of Information Engineering, Guangdong University of Technology, Guangzhou 510006
Abstract:In order to promote the development of smart new retail in the offline scenario and improve the recognition accuracy of price tags,which is a key sales information,this paper studies this application scene to improve the recognition accuracy effectively of the price tag and solves the difficulty of locating decimal point.The deep semantic expression features of price tag are extracted by the deep convolutional neural network,sent to the multi-task recurrent network layer for encoding,and then the price number is decoded according to the attention mechanism of the decoding network,and finally the results of multiple branches are integrated to output the complete price.The method we proposed can effectively improve the price recognition accuracy in the smart new retail scenario and solve some challenge problems such as locating the position of decimal point.In addition,in order to verify the universality of the method,comparative experiments are carried out on datasets of other scenarios,and the related results also verify the effectiveness of the method in this paper.
Keywords:Convolutional neural network  recurrent neural network  text recognition  multi-task learning  price tag recognition
本文献已被 维普 等数据库收录!
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号