特征增强和双线性特征向量融合的移动端工业货箱文本检测 Feature enhancement and bilinear feature vector fusion for text detection of mobile industrial containers期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

特征增强和双线性特征向量融合的移动端工业货箱文本检测

引用本文：	胡海洋,厉泽品,李忠金. 特征增强和双线性特征向量融合的移动端工业货箱文本检测[J]. 电信科学, 2022, 38(7): 75-87. DOI: 10.11959/j.issn.1000-0801.2022139

作者姓名：	胡海洋厉泽品李忠金

作者单位：	杭州电子科技大学计算机学院,浙江杭州 310018;浙江省脑机协同智能重点实验室,浙江杭州 310018

基金项目：	国家自然科学基金资助项目(61572162);国家自然科学基金资助项目(61802095);浙江省重点研发计划项目(2018C01012);浙江省自然科学基金资助项目(LQ17F020003)

摘要：	在实际工业环境下,光线昏暗、文本不规整、设备有限等因素,使得文本检测成为一项具有挑战性的任务。针对此问题,设计了一种基于双线性操作的特征向量融合模块,并联合特征增强与半卷积组成轻量级文本检测网络RGFFD(ResNet18+GhostModule+特征金字塔增强模块(feature pyramid enhancement module,FPEM)+特征融合模块(feature fusion module,FFM)+可微分二值化(differenttiable binarization,DB))。其中,Ghost模块内嵌特征增强模块,提升特征提取能力,双线性特征向量融合模块融合多尺度信息,添加自适应阈值分割算法提高DB模块分割能力。在实际工厂环境下,采用嵌入式设备UP2 board对货箱编号进行文本检测,RGFFD检测速度达到6.5 f/s。同时在公共数据集ICDAR2015、Total-text上检测速度分别达到39.6 f/s和49.6 f/s,在自定义数据集上准确率达到88.9%,检测速度为30.7 f/s。
关键词：	半卷积特征融合文本检测特征增强特征向量融合
Feature enhancement and bilinear feature vector fusion for text detection of mobile industrial containers

Haiyang HU,Zepin LI,Zhongjin LI. Feature enhancement and bilinear feature vector fusion for text detection of mobile industrial containers[J]. Telecommunications Science, 2022, 38(7): 75-87. DOI: 10.11959/j.issn.1000-0801.2022139

Authors:	Haiyang HU Zepin LI Zhongjin LI

Affiliation:	1. School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, China;2. Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province, Hangzhou 310018, China

Abstract:	In the real factory environment, due to factors such as dim light, irregular text, and limited equipment, text detection becomes a challenging task.Aiming at this problem, a feature vector fusion module based on bilinear operation was designed and combined with feature enhancement and semi-convolution to form a lightweight text detection network RGFFD (ResNet18 + Ghost Module + FPEM(feature pyramid enhancement module)) + FFM(feature fusion module) + DB (differentiable binarization)).Among them, the Ghost module was embedded with a feature enhancement module to improve the feature extraction capability, the bilinear feature vector fusion module fused multi-scale information, and an adaptive threshold segmentation algorithm was added to improve the segmentation capability of the DB module.In the real industrial environment, the RGFFD detection speed reached 6.5 f/s, when using the embedded device UP2 board for text detection of container numbers.At the same time, the detection speed on the public datasets ICDAR2015 and Total-text reached 39.6 f/s and 49.6 f/s, respectively.The accuracy rate on the custom dataset reached 88.9%, and the detection speed was 30.7 f/s.

Keywords:	text detection semi-convolution feature vector fusion feature enhancement feature fusion
本文献已被维普万方数据等数据库收录！
	点击此处可从《电信科学》浏览原始摘要信息
	点击此处可从《电信科学》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏