融合多路特征和注意力机制的强化学习关键帧提取算法 Key frame extraction algorithm of reinforcement learning based on multi-channel feature and attention mechanism期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

融合多路特征和注意力机制的强化学习关键帧提取算法

引用本文：	曹春萍,苑凯歌.融合多路特征和注意力机制的强化学习关键帧提取算法[J].计算机应用研究,2022,39(4):1274-1280.

作者姓名：	曹春萍苑凯歌

作者单位：	上海理工大学光电信息与计算机工程学院,上海200093

摘要：	针对现有视频关键帧提取算法对运动类视频中运动特征提取不准导致的漏检和误检问题，提出一种融合多路特征和注意力机制的强化学习关键帧提取算法。该算法首先通过人体姿态识别算法对视频序列进行人体骨骼关节点提取；然后使用S-GCN和ResNet50网络分别提取视频序列中的运动特征和静态特征，并将两者进行加权融合；最后应用注意力机制对特征序列进行视频帧重要性计算，并利用强化学习进行关键帧的提取和优化。实验结果表明，该算法能较好地解决运动类视频在关键帧提取中出现的漏误检问题，在检测含有关键性动作的视频帧时表现较好，算法准确率高、稳定性强。
关键词：	人体骨骼人体姿态识别算法 S-GCN 注意力机制 ResNet50 强化学习
收稿时间：	2021/6/28 0:00:00
修稿时间：	2022/3/14 0:00:00
Key frame extraction algorithm of reinforcement learning based on multi-channel feature and attention mechanism

caochunping and yuankaige.Key frame extraction algorithm of reinforcement learning based on multi-channel feature and attention mechanism[J].Application Research of Computers,2022,39(4):1274-1280.

Authors:	caochunping and yuankaige

Affiliation:	University of Shanghai for Science & Technology,

Abstract:	Aiming at the problem of missing detection and false detection caused by inaccurate motion feature extraction of existing video key frame extraction algorithms, this paper proposed a reinforcement learning key frame extraction algorithm combining multi-channel feature and attention mechanism. The algorithm extracted the human skeleton joint points from the video sequence through the human posture recognition algorithm firstly. Then it used the S-GCN and ResNet50 network to extract the motion features and static features in the video sequence respectively, and performed a weighted fusion of the two. Finally it applied the attention mechanism to calculate the importance of the video frame of the feature sequence, and used reinforcement learning to extract and optimize key frames. The experimental results show that the algorithm can solve the problem of missing and false detection in the key frame extraction of motion video. It performs well in the detection of video frames containing key actions, with high accuracy and strong stability.

Keywords:	human skeleton human posture recognition S-GCN(spatial graph convolutional networks) attention mechanism ResNet50(residual network50) reinforcement learning
本文献已被万方数据等数据库收录！
	点击此处可从《计算机应用研究》浏览原始摘要信息
	点击此处可从《计算机应用研究》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏