首页 | 本学科首页   官方微博 | 高级检索  
     

时序行为提名的上下文信息融合方法
引用本文:王新文,谢林柏,彭力.时序行为提名的上下文信息融合方法[J].计算机科学与探索,2021,15(3):486-494.
作者姓名:王新文  谢林柏  彭力
作者单位:物联网技术应用教育部工程研究中心(江南大学物联网工程学院),江苏无锡214122
基金项目:教育部-中国移动科研基金项目;国家自然科学基金
摘    要:在针对视频的人体活动定位和识别领域中,现有的时序行为提名方法无法很好地解决行为特征长期依赖性而导致提名召回率较低。针对此问题,提出了一种上下文信息融合的时序行为提名方法。该方法首先采用三维卷积网络提取视频单元的时空特征,然后采用双向门控循环网络构建上下文关系预测出时序行为区间。针对门控循环单元(GRU)存在参数较多和梯度消失的问题,通过输入特征控制门结构增强并行计算能力,通过引入加权平均增强历史和当前时刻信息融合能力,提出了一个简化的门控循环单元(S-GRU)。最后在数据集Thumos14上进行实验验证和比较,结果表明基于双向S-GRU循环网络的时序行为提名方法提高了提名召回率。

关 键 词:门控循环网络(GRU)  梯度消失  上下文信息  时序行为提名  时序行为检测

Context Information Fusion Method for Temporal Action Proposals
WANG Xinwen,XIE Linbo,PENG Li.Context Information Fusion Method for Temporal Action Proposals[J].Journal of Frontier of Computer Science and Technology,2021,15(3):486-494.
Authors:WANG Xinwen  XIE Linbo  PENG Li
Affiliation:(Engineering Research Center of Internet of Things Technology Applications(School of Internet of Things Engineering,Jiangnan University),Ministry of Education,Wuxi,Jiangsu 214122,China)
Abstract:In the field of human activity localization and recognition in videos,the existing temporal action proposal methods have not solved the long-term dependence problem better,which results in lower recall rates of proposals.In view of this problem,a method based on context information fusion for temporal action proposals is proposed in this paper.Firstly,the spatiotemporal features of video units are extracted by the 3D convolutional network.Then,the bidirectional recurrent network is used to construct the context relationship for predicting the temporal action proposals.Considering the problems of more parameters and the vanishing gradient in the gated recurrent unit(GRU),a simplified-GRU(S-GRU)is proposed,in which the input features control the gating structure to enhance the parallel computing capability and the weighted average is introduced to enhance the ability of the gated recurrent unit to adaptively fuse the history and current time information.Finally,experimental results on the Thumos14 dataset demonstrate that the method based on the bidirectional S-GRU for temporal action proposals improves the recall rate of proposals.
Keywords:gated recurrent network(GRU)  vanishing gradient  context information  temporal action proposals  temporal action detection
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号