首页 | 本学科首页   官方微博 | 高级检索  
     

基于多粒度视频信息和注意力机制的视频场景识别
引用本文:袁韶祖,王雷全,吴春雷.基于多粒度视频信息和注意力机制的视频场景识别[J].计算机系统应用,2020,29(5):252-256.
作者姓名:袁韶祖  王雷全  吴春雷
作者单位:中国石油大学(华东)计算机科学与技术学院,青岛 266580;中国石油大学(华东)计算机科学与技术学院,青岛 266580;中国石油大学(华东)计算机科学与技术学院,青岛 266580
基金项目:山东省重点研发计划(2019GGX101015);中央高校自主创新科研计划(17CX02041A,18CX02136A)
摘    要:视频场景识别是机器学习和计算机视觉一个重要的研究领域.但是当前对于视频场景识别的探索工作还远远不够,而且目前提出的模型大都使用视频级的特征信息,忽略了多粒度的视频特征关联.本文提出了一种基于多粒度的视频特征的注意力机制的模型架构,可以动态高效的利用各维度视频信息之间存在的丰富的语义关联,提高识别准确度.本文在中国多媒体大会(CCF ChinaMM 2019)最新推出的VideoNet数据集上进行了实验,实验结果表明基于多粒度的视频特征的注意力机制的模型与传统方法相比具有明显的优越性.

关 键 词:视频场景识别  注意力机制  多粒度视频信息  卷积神经网络  检测网络
收稿时间:2019/10/16 0:00:00
修稿时间:2019/11/15 0:00:00

Video Scene Recognition with Multi-Granularity Video Features and Attention Mechanism
YUAN Shao-Zu,WANG Lei-Quan,WU Chun-Lei.Video Scene Recognition with Multi-Granularity Video Features and Attention Mechanism[J].Computer Systems& Applications,2020,29(5):252-256.
Authors:YUAN Shao-Zu  WANG Lei-Quan  WU Chun-Lei
Affiliation:College of Computer Science and Technology, China University of Petroleum, Qingdao 266580, China
Abstract:Video scene recognition has attracted much attention in the field of machine learning and computer vision. It is not only an important practical application, but also a challenge for image understanding in the field of computer vision. Nevertheless, current exploration of video scene recognition has not been unable to meet the needs of production environment. And most proposed models only use video-level feature information, while ignore association of multi-granularity video feature. In this study, we propose an architecture of attention mechanism with multi-granularity video features, which can make use of the rich semantic association among the various dimensions of video information dynamically and efficiently, and improve the performance of the model. The experiments are conducted on the latest VideoNet dataset released by CCF China MM 2019. The result shows that the proposed model based on attention mechanism model with multi-granularity video features outperforms the previous methods.
Keywords:video scene recognition  attention mechanism  multi-granularity video features  CNN  detection network
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号