基于深度卷积特征的场景全局与局部表示方法 Globaland Local Scene Representation Method Based on Deep Convolutional Features期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于深度卷积特征的场景全局与局部表示方法

引用本文：	林潮威,李菲菲,陈虬.基于深度卷积特征的场景全局与局部表示方法[J].电子科技,2022,35(4):20-27.

作者姓名：	林潮威李菲菲陈虬

作者单位：	上海理工大学光电信息与计算机工程学院,上海 200093

基金项目：	上海市高校特聘教授东方学者岗位计划项目

摘要：	场景识别是计算机视觉研究中的一项基本任务.与图像分类不同,场景识别需要综合考虑场景的背景信息、局部场景特征以及物体特征等因素,导致经典卷积神经网络在场景识别上性能欠佳.为解决此问题,文中提出了一种基于深度卷积特征的场景全局与局部表示方法.此方法对场景图片的卷积特征进行变换从而为每张图片生成一个综合的特征表示.使用CAM...
关键词：	场景识别卷积神经网络卷积特征特征变换类激活图长短期记忆注意力机制端到端网络
收稿时间：	2020-11-21
Globaland Local Scene Representation Method Based on Deep Convolutional Features

LIN Chaowei,LI Feifei,CHEN Qiu.Globaland Local Scene Representation Method Based on Deep Convolutional Features[J].Electronic Science and Technology,2022,35(4):20-27.

Authors:	LIN Chaowei LI Feifei CHEN Qiu

Affiliation:	School of Optical-Electrical and Computer Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China

Abstract:	Scene Recognition is a fundamental task in computer vision. Different from image classification, scene recognition needs to take a comprehensive consideration of factors such as global layout information, local scene features, and object features, which leads to the poor performance of classic convolutional neural network for scene recognition. In order to solve this issue, this study proposes a global and local scene representation method based on deep convolutional features. The proposed method transforms deep convolutional features of scene image to generate a comprehensive representation for each image. Specifically, CAM is used to discovery local key regions, and LSTM is used to encode convolutional features extracted from local key regions to produce the local representation for scene images. Attention mechanism is adopted to fuse scene features and object features to form a global representation for scene images. Finally, the evaluation experiments are conducted on MIT indoor 67 data set and the results show that the test accuracy is up to 87.59% using the proposed method.

Keywords:	scene recognition convolutional neural networks convolutional features feature transform CAM LSTM attention mechanism end-to-end network
本文献已被万方数据等数据库收录！
	点击此处可从《电子科技》浏览原始摘要信息
	点击此处可从《电子科技》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏