首页 | 本学科首页   官方微博 | 高级检索  
     

基于深度卷积特征的场景全局与局部表示方法
引用本文:林潮威,李菲菲,陈虬.基于深度卷积特征的场景全局与局部表示方法[J].电子科技,2022,35(4):20-27.
作者姓名:林潮威  李菲菲  陈虬
作者单位:上海理工大学 光电信息与计算机工程学院,上海 200093
基金项目:上海市高校特聘教授东方学者岗位计划项目
摘    要:场景识别是计算机视觉研究中的一项基本任务.与图像分类不同,场景识别需要综合考虑场景的背景信息、局部场景特征以及物体特征等因素,导致经典卷积神经网络在场景识别上性能欠佳.为解决此问题,文中提出了一种基于深度卷积特征的场景全局与局部表示方法.此方法对场景图片的卷积特征进行变换从而为每张图片生成一个综合的特征表示.使用CAM...

关 键 词:场景识别  卷积神经网络  卷积特征  特征变换  类激活图  长短期记忆  注意力机制  端到端网络
收稿时间:2020-11-21

Globaland Local Scene Representation Method Based on Deep Convolutional Features
LIN Chaowei,LI Feifei,CHEN Qiu.Globaland Local Scene Representation Method Based on Deep Convolutional Features[J].Electronic Science and Technology,2022,35(4):20-27.
Authors:LIN Chaowei  LI Feifei  CHEN Qiu
Affiliation:School of Optical-Electrical and Computer Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China
Abstract:Scene Recognition is a fundamental task in computer vision. Different from image classification, scene recognition needs to take a comprehensive consideration of factors such as global layout information, local scene features, and object features, which leads to the poor performance of classic convolutional neural network for scene recognition. In order to solve this issue, this study proposes a global and local scene representation method based on deep convolutional features. The proposed method transforms deep convolutional features of scene image to generate a comprehensive representation for each image. Specifically, CAM is used to discovery local key regions, and LSTM is used to encode convolutional features extracted from local key regions to produce the local representation for scene images. Attention mechanism is adopted to fuse scene features and object features to form a global representation for scene images. Finally, the evaluation experiments are conducted on MIT indoor 67 data set and the results show that the test accuracy is up to 87.59% using the proposed method.
Keywords:scene recognition  convolutional neural networks  convolutional features  feature transform  CAM  LSTM  attention mechanism  end-to-end network  
本文献已被 万方数据 等数据库收录!
点击此处可从《电子科技》浏览原始摘要信息
点击此处可从《电子科技》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号