首页 | 本学科首页   官方微博 | 高级检索  
     

基于深度学习的人体动作识别综述
引用本文:钱慧芳,易剑平,付云虎.基于深度学习的人体动作识别综述[J].计算机科学与探索,2021,15(3):438-455.
作者姓名:钱慧芳  易剑平  付云虎
作者单位:西安工程大学电子信息学院,西安710048
摘    要:人体动作识别是视频理解领域的重要课题之一,在视频监控、人机交互、运动分析、视频信息检索等方面有着广泛的应用.根据骨干网络的特点,从2D卷积神经网络、3D卷积神经网络、时空分解网络三个角度介绍了动作识别领域的最新研究成果,并对三类方法的优缺点进行了定性的分析和比较.然后,从场景相关和时间相关两方面,全面归纳了常用的动作视...

关 键 词:人体动作识别  2D卷积神经网络(2D  CNN)  3D卷积神经网络(3D  CNN)  时空分解网络  预训练

Review of Human Action Recognition Based on Deep Learning
QIAN Huifang,YI Jianping,FU Yunhu.Review of Human Action Recognition Based on Deep Learning[J].Journal of Frontier of Computer Science and Technology,2021,15(3):438-455.
Authors:QIAN Huifang  YI Jianping  FU Yunhu
Affiliation:(School of Electronics and Information,Xi'an Polytechnic University,Xi'an 710048,China)
Abstract:Human action recognition is one of the important topics in video understanding.It is widely used in video surveillance,human-computer interaction,motion analysis,and video information retrieval.According to the characteristics of the backbone network,this paper introduces the latest research results in the field of action recognition from three perspectives:2D convolutional neural network,3D convolutional neural network,and spatiotemporal decomposition network.And their advantages and disadvantages are qualitatively analyzed and compared.Then,from the two aspects of scene-related and temporal-related,the commonly used action video datasets are comprehensively summarized,and the characteristics and usage of different datasets are emphatically discussed.Subsequently,the common pre-training strategies in action recognition tasks are introduced,and the influence of pre-training techniques on the performance of action recognition models is emphatically analyzed.Finally,starting from the latest research trends,the future development direction of action recognition is discussed from six perspectives:fine-grained action recognition,streamlined model,few-shot learning,unsupervised learning,adaptive network,and video super-resolution action recognition.
Keywords:human action recognition  2D convolutional neural network(2D CNN)  3D convolutional neural network(3D CNN)  spatiotemporal decomposition network  pre-training
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号