首页 | 本学科首页   官方微博 | 高级检索  
     


BURSTS: A bottom-up approach for robust spotting of texts in scenes
Affiliation:1. Academy for Engineering and Technology, Fudan University, Shanghai 200433, China;2. Shanghai Engineering Research Center of AI & Robotics, China;3. Engineering Research Center of AI & Robotics, Ministry of Education, China;4. School of Information Science and Technology, Fudan University, Shanghai 200433, China;5. Department of Industrial and Manufacturing Systems Engineering, University of Michigan-Dearborn, Dearborn, MI 48128, USA;1. Department of Cardiology, China-Japan Union Hospital of Jilin University, Changchun, China;2. Jilin Provincial Key Laboratory for Genetic Diagnosis of Cardiovascular Disease, Changchun, China;3. Jilin Provincial Molecular Biology Research Center for Precision Medicine of Major Cardiovascular Disease, Changchun, China;4. Department of Clinical Laboratory, China-Japan Union Hospital of Jilin University, Changchun, China;1. Department of Computer Science and Technology, Jilin University, Changchun 130012, China;2. Key Laboratory of Symbolic Computation and Knowledge Engineering for Ministry of Education, Jilin University, Changchun 130012, China;3. Editorial Department of Journal (Engineering and Technology Edition), Jilin University, Jilin, Changchun 130012, China;1. Department of Electronic and Information Engineering, Beijing Jiaotong University, Beijing 100044, China;2. College of information, Liaoning University, Liaoning 110036, China
Abstract:In this paper, we present a bottom-up approach for robust spotting of texts in scenes. In the proposed technique, character candidates are first detected using our proposed character detector, which leverages on the strengths of an Extremal Region (ER) detector and an Aggregate Channel Feature (ACF) detector for high character detection recall. The real characters are then identified by using a novel convolutional neural network (CNN) filter for high character detection precision. A hierarchical clustering algorithm is designed which combines multiple visual and geometrical features to group characters into word proposal regions for word recognition. The proposed technique has been evaluated on several scene text spotting datasets and experiments show superior spotting performance.
Keywords:Text spotting  CNN  Extremal region  Clustering  41A05  41A10  65D05  65D17
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号