基于时空张量融合的人体骨架行为自适应识别方法 Adaptive recognition method of human skeleton action with spatial-temporal tensor fusion期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于时空张量融合的人体骨架行为自适应识别方法

引用本文：	建中华,南静,刘鑫,代伟.基于时空张量融合的人体骨架行为自适应识别方法[J].仪器仪表学报,2023,44(6):74-85.

作者姓名：	建中华南静刘鑫代伟

作者单位：	1. 中国矿业大学人工智能研究院;2. 中国矿业大学信息与控制工程学院;1. 中国矿业大学人工智能研究院，2. 中国矿业大学信息与控制工程学院

基金项目：	国家自然科学基金面上项目(61973306)、江苏省优秀青年基金(BK20200086)、江苏省研究生科研与实践创新计划项目(KYCX22_2558)资助

摘要：	针对人体行为的空间复杂性和时间差异性问题,提出了一种基于时空张量融合的人体骨架行为自适应识别方法。首先充分利用人体行为骨架序列的帧内空间关系和帧间时间关系,构建相邻帧时空特征张量;其次通过计算相邻帧时空特征张量的差异性获取关键相邻帧时空特征张量并组成行为时空特征张量;之后利用行为时空特征张量的空间特征差异和多尺度时间卷积构建行为时空特征张量自适应注意力机制,完成行为时空特征融合;最后,使用深度随机配置网络根据行为时空特征融合张量识别人体行为。使用 NTU RGB-D 数据集进行实验仿真,识别准确率达到 84. 57% ,并且设计相应的系统进行实际应用验证, 结果表明本文所提方法是一种适合应对人体行为空间复杂性和时间差异性问题的人体行为识别方法。
关键词：	人体行为识别人体骨架注意力机制关键帧时空特征
Adaptive recognition method of human skeleton action with spatial-temporal tensor fusion

Jian Zhonghu,Nan Jing,Liu Xin,Dai Wei.Adaptive recognition method of human skeleton action with spatial-temporal tensor fusion[J].Chinese Journal of Scientific Instrument,2023,44(6):74-85.

Authors:	Jian Zhonghu Nan Jing Liu Xin Dai Wei

Abstract:	To address space complexity and time difference of human action, an adaptive recognition method of human skeleton action with spatial-temporal tensor fusion is proposed. Firstly, the spatial-temporal feature tensors of adjacent frames are established by making full use of the intra-frame spatial relationship and the inter-frame temporal relationship of the human action skeleton sequences. Secondly, the difference of spatial-temporal feature tensors of adjacent frames is calculated to achieve the spatial-temporal feature tensors of key adjacent frames and compose the behavior spatial-temporal feature tensors. Then, the spatial feature difference of the action spatial-temporal feature tensors and the multi-scale temporal convolution is used to construct the adaptive attention mechanism of the behavior spatio-temporal feature tensors to complete the fusion of action spatial-temporal features. Finally, a deep stochastic configuration network is used to recognize human action according to the spatial-temporal feature fusion tensor of action. The NTU RGB-D data set was used for experimental simulation, and the recognition accuracy reached 84. 57%. The corresponding system is designed for practical application verification. The results show that the proposed method is suitable for dealing with the space complexity and time difference of human action recognition.

Keywords:	human action recognition human skeleton attention mechanism keyframe spatial-temporal feature

	点击此处可从《仪器仪表学报》浏览原始摘要信息
	点击此处可从《仪器仪表学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏