首页 | 本学科首页   官方微博 | 高级检索  
     

融入注意力机制的深度学习动作识别
引用本文:张 宇,张 雷.融入注意力机制的深度学习动作识别[J].电讯技术,2021,61(10):1205-1212.
作者姓名:张 宇  张 雷
作者单位:北京建筑大学 电气与信息工程学院,北京100044
基金项目:智能机器人与系统北京高精尖创新中心建设项目(00921917001);北京市重点实验室项目(BZ0337)
摘    要:针对现有的深度学习方法在人体动作识别中易出现过拟合、易受到干扰信息影响、特征表达能力不足的问题,提出了一种融入注意力机制的深度学习动作识别方法.该方法在数据预处理中提出了视频数据增强算法,降低了模型过拟合的风险,然后在视频帧采样过程中对现有的采样算法进行了改进,有效抑制了干扰信息的影响,并在特征提取部分提出了融入注意力的残差网络,提高了模型的特征提取能力;之后,利用长短时记忆(Long Short-Term Memory,LSTM)网络解决了空间特征的时序关联问题;最后,通过Softmax完成了相应动作的分类.实验结果表明,在UCF YouTube、KTH和HMDB-51数据集上,所提方法的识别率分别为96.72%、98.06%和64.81%.

关 键 词:动作识别  深度学习  残差网络  注意力机制  长短时记忆网络

Deep learning action recognition with attention mechanism
ZHANG Yu,ZHANG Lei.Deep learning action recognition with attention mechanism[J].Telecommunication Engineering,2021,61(10):1205-1212.
Authors:ZHANG Yu  ZHANG Lei
Affiliation:School of Electrical and Information Engineering,Beijing University of Civil Engineering and Architecture,Beijing 100044,China
Abstract:In order to solve the problem that the existing deep learning methods are prone to overfitting,easily affected by interference information,and lack of feature expression ability in human action recognition,a deep learning action recognition method with attention mechanism is proposed.This method proposes a video data enhancement algorithm in the data preprocessing to reduce the risk of model overfitting,and improves the existing sampling algorithm in the video frame sampling process to effectively suppress the influence of interference information,and in the feature extraction part,residual network with attention is proposed to enhance the feature extraction ability of the model.Then,Long short term memory(LSTM) network is used to solve the timing sequential correlation problem of spatial features.Finally,the classification of the corresponding actions is completed by Softmax.The experimental results show that the recognition rates of the proposed method are 96.72%,98.06% and 64.81% on UCF YouTube,KTH and HMDB 51 datasets,respectively.
Keywords:action recognition  deep learning  residual network  attention mechanism  long short term memory network
本文献已被 万方数据 等数据库收录!
点击此处可从《电讯技术》浏览原始摘要信息
点击此处可从《电讯技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号