LDNet: Lightweight dynamic convolution network for human pose estimation期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

LDNet: Lightweight dynamic convolution network for human pose estimation

Affiliation:	1. School of Mechanical Engineering, University of Shanghai for Science and Technology, China;2. School of Mechanical Engineering, Jiangsu University, China;3. College of Engineering, Coventry University, Coventry, UK;4. School of Physics, Engineering & Computer Science, University of Hertfordshire, UK

Abstract:	Lightweight implementation of existing human pose estimation networks limits the model representation capability, and it cannot effectively deal with problems such as changeable poses, complex backgrounds, and occlusion in practical applications. To address this problem, a lightweight human pose estimation network with dynamic convolution, called LDNet, is proposed in this study. First, we start from a lightweight feature extraction head to reduce the number of image preprocessing parameters. Then, we employ a high-resolution parallel subnetwork to predict precise keypoint heatmaps. To reduce the complexity due to high-resolution representations while maintaining good network performance, we propose a lightweight dynamic convolution. It can cope with changing human poses by adaptively learning different convolution parameters. Finally, to further exploit the relationship between the high-level semantic and spatial structure features for accurately locating different keypoints, we propose a keypoint refinement module based on our lightweight dynamic convolution to improve the keypoint detection and location results. Overall, accurate keypoint prediction results are obtained and compared with those of many existing networks, such as HRNet, the number of parameters is reduced by 82.1% and the calculation complexity is reduced by 47.9%. The model achieves an average precision of 73.5% and 88.7% on the COCO 2017 and MPII datasets, respectively. LDNet also shows good prediction accuracy and robustness on the CrowdPose dataset. The proposed network is superior to existing outstanding lightweight networks and is comparable to existing large-scale human pose estimation networks.

Keywords:	Human pose estimation Lightweight dynamic convolution Adaptive learning Keypoint optimization
本文献已被 ScienceDirect 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏