首页 | 本学科首页   官方微博 | 高级检索  
     

基于深度学习框架的多模态动作识别
引用本文:韩敏捷.基于深度学习框架的多模态动作识别[J].计算机与现代化,2017,0(7):48.
作者姓名:韩敏捷
基金项目:国家自然科学基金资助项目(61672285)
摘    要:提出一种基于深度神经网络的多模态动作识别方法,根据不同模态信息的特性分别采用不同的深度神经网络,适应不同模态的视频信息,并将多种深度网络相结合,挖掘行为识别的多模态特征。主要考虑人体行为静态和动态2种模态信息,结合微软Kinect的多传感器摄像机获得传统视频信息的同时也能获取对应的深度骨骼点信息。对于静态信息采用卷积神经网络模型,对于动态信息采用递归循环神经网络模型。最后将2种模型提取的特征相融合进行动作识别和分类。在MSR 3D的行为数据库上实验结果表明,本文的方法对动作识别具有良好的分类效果。

关 键 词:深度学习    多模态    动作识别  
收稿时间:2017-07-20

Multi-modal Action Recognition Based on Deep Learning Framework
HAN Min-jie.Multi-modal Action Recognition Based on Deep Learning Framework[J].Computer and Modernization,2017,0(7):48.
Authors:HAN Min-jie
Abstract:This paper proposes an approach for multi-modal action recognition based on deep neural networks. In order to process different modal video information, different artificial networks are utilized and combined to exploit the multi-modal features. We mainly consider the static and dynamic modalities of human action. With the assistance of Microsoft Kinect sensor camera, the visual and depth skeleton data of video can be captured simultaneously. For the static RGB information, we implement Convolutional Neural Networks, while for the dynamic information we use Recurrent Neural Networks. Finally, we combine the extraction features through these two networks and train the action classifier. The experiment results on the MSR 3D datasets show the effectiveness of our method.
Keywords:deep learning  multi-modality  action recognition  
点击此处可从《计算机与现代化》浏览原始摘要信息
点击此处可从《计算机与现代化》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号