多聚点子空间下的时空信息融合及其在行为识别中的应用 Recognizing Action Using Multi-center Subspace Learning-based Spatial-temporal Information Fusion期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

多聚点子空间下的时空信息融合及其在行为识别中的应用

引用本文：	杨天金, 侯振杰, 李兴, 梁久祯, 宦娟, 郑纪翔. 多聚点子空间下的时空信息融合及其在行为识别中的应用. 自动化学报, 2022, 48(11): 2823−2835 doi: 10.16383/j.aas.c190327

作者姓名：	杨天金侯振杰李兴梁久祯宦娟郑纪翔

作者单位：	1.常州大学计算机与人工智能学院阿里云大数据学院软件学院常州 213164;;2.江苏省物联网移动互联技术工程重点实验室淮安 223003

基金项目：	国家自然科学基金(61803050, 61063021), 江苏省物联网移动互联技术工程重点实验室开放课题基金(JSWLW-2017-013), 浙江省公益技术研究社会发展项目(2017C33223)资助

摘要：	基于深度序列的人体行为识别, 一般通过提取特征图来提高识别精度, 但这类特征图通常存在时序信息缺失的问题. 针对上述问题, 本文提出了一种新的深度图序列表示方式, 即深度时空图(Depth space time maps, DSTM). DSTM降低了特征图的冗余度, 弥补了时序信息缺失的问题. 本文通过融合空间信息占优的深度运动图(Depth motion maps, DMM) 与时序信息占优的DSTM, 进行高精度的人体行为研究, 并提出了多聚点子空间学习(Multi-center subspace learning, MCSL)的多模态数据融合算法. 该算法为各类数据构建多个投影聚点, 以此增大样本的类间距离, 降低了投影目标区域维度. 本文在MSR-Action3D数据集和UTD-MHAD数据集上进行人体行为识别. 最后实验结果表明, 本文方法相较于现有人体行为识别方法有着较高的识别率.
关键词：	行为识别信息融合深度时空图多聚点子空间学习
收稿时间：	2019-04-30
Recognizing Action Using Multi-center Subspace Learning-based Spatial-temporal Information Fusion

Yang Tian-Jin, Hou Zhen-Jie, Li Xing, Liang Jiu-Zhen, Huan Juan, Zheng Ji-Xiang. Recognizing action using multi-center subspace learning-based spatial-temporal information fusion. Acta Automatica Sinica, 2022, 48(11): 2823−2835 doi: 10.16383/j.aas.c190327

Authors:	YANG Tian-Jin HOU Zhen-Jie LI Xing LIANG Jiu-Zhen HUAN Juan ZHENG Ji-Xiang

Affiliation:	1. School of Computer Science and Artificial Intelligence Aliyun School of Big Data School of Software, Changzhou 213164;;2. Jiangsu Key Laboratory of Internet of Things Mobile Internet Technology Engineering, Huai＇an 223003

Abstract:	Human action recognitions from depth map sequences improve the recognition accuracy by extracting feature maps. A new representation of depth map sequences called depth space time map (DSTM) is proposed in this paper for overcoming the lack of temporal information in the feature maps. DSTM reduces the redundancy of action features. We conduct high-precision human action recognitions by fusing depth motion maps (DMM) and DSTM based on a new multi-modal data fusion algorithm called multi-center subspace learning (MCSL). The algorithm constructs multiple projection centers for each class data to expand the samples inter-class distance and reduce the projection target area dimension. Experiments conducted on MSR-Action3D and UTD-MHAD depth database show the effectiveness of the proposed method.

Keywords:	Action recognition information fusion depth space time maps (DSTM) multi-center subspace learning (MCSL)

	点击此处可从《自动化学报》浏览原始摘要信息
	点击此处可从《自动化学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏