首页 | 本学科首页   官方微博 | 高级检索  
     


Hypercolumn-array based image representation and its application to shape-based object detection
Affiliation:1. School of Computer Science, Laboratory of Cognitive Modeling and Algorithms, Shanghai Key Laboratory of Data Science, Fudan University, Shanghai, China;2. School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, China;1. Ind. Eng., Alzahra University, Tehran, Iran;2. Ind. Eng., Shahed University, Tehran, Iran;1. Department of Psychology, Sun Yat-Sen University, Guangzhou, China;2. School of Computer Science & Engineering, South China University of Technology, Guangzhou, China;1. School of Aerospace, Transport Systems and Manufacturing, Cranfield University, College Road, Bedfordshire MK43 0AL, UK;2. College of Engineering, Mathematics and Physical Systems, University of Exeter, EX4 4SB, UK;1. Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia;2. Department of Computer Engineering, Hashtgerd Branch, Islamic Azad University, Alborz, Iran;1. Institute of Industrial Research, Unit 1 St Andrews Court, University of Portsmouth, Hampshire, PO1 2PR, United Kingdom;2. Seagate Technology, Langstone Road, Havant, Hampshire, PO9 1SA, United Kingdom
Abstract:Biological and psychological evidence increasingly reveals that high-level geometrical and topological features are the keys to shape-based object recognition in the brain. Attracted by the excellent performance of neural visual systems, we simulate the mechanism of hypercolumns in the mammalian cortical area V1 that selectively responds to oriented bar stimuli. We design an orderly-arranged hypercolumn array to extract and represent linear or near-linear stimuli in an image. Each unit of this array covers stimuli of various orientations in a small area, and multiple units together produce a low-dimensional vector to describe shape. Based on the neighborhood of units in the array, we construct a graph whose node represents a short line segment with a certain position and slope. Therefore, a contour segment in the image can be represented with a route in this graph. The graph converts an image, comprised of typically unstructured raw data, into structured and semantic-enriched data. We search along the routes in the graph and compare them with a shape template for object detection. The graph greatly upgrades the level of image representation, remarkably reduces the load of combinations, significantly improves the efficiency of object searching, and facilitates the intervening of high-level knowledge. This work provides a systematic infrastructure for shape-based object recognition.
Keywords:Orientation column  Shape representation  Object recognition  Primary visual cortex
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号