融入注意力机制的多模特征机械臂抓取位姿检测 Multi-modal feature robotic arm grasping pose detection with attention mechanism期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

融入注意力机制的多模特征机械臂抓取位姿检测

引用本文：	楚红雨,冷齐齐,张晓强,常志远,邵延华. 融入注意力机制的多模特征机械臂抓取位姿检测[J]. 控制与决策, 2024, 39(3): 777-785

作者姓名：	楚红雨冷齐齐张晓强常志远邵延华

作者单位：	西南科技大学信息工程学院,四川绵阳 621010

基金项目：	国防科工局项目([2019]1276)；国家自然科学基金项目(12175187)；西南科技大学博士基金项目(19zx7123).

摘要：	针对机械臂抓取检测任务中对未知物体抓取位姿检测精度低、耗时长等问题,提出一种融入注意力机制多模特征抓取位姿检测网络.首先,设计多模态特征融合模块,在融合多模态特征同时对其赋权加强;然后,针对较浅层残差网络提取重点特征能力较弱的问题,引入卷积注意力模块,进一步提升网络特征提取能力;最后,通过全连接层对提取特征直接进行回归拟合,得到最优抓取检测位姿.实验结果表明,在Cornell公开抓取数据集上,所提出算法的图像拆分检测精度为98.9%,对象拆分检测精度为98.7%,检测速度为51FPS,对10类物体的100次真实抓取实验中,成功率为95%.
关键词：	目标抓取位姿检测机械臂注意力机制多模态特征深度学习
Multi-modal feature robotic arm grasping pose detection with attention mechanism

CHU Hong-yu,LENG Qi-qi,ZHANG Xiao-qiang,CHANG Zhi-yuan,SHAO Yan-hua. Multi-modal feature robotic arm grasping pose detection with attention mechanism[J]. Control and Decision, 2024, 39(3): 777-785

Authors:	CHU Hong-yu LENG Qi-qi ZHANG Xiao-qiang CHANG Zhi-yuan SHAO Yan-hua

Affiliation:	College of Information Engineering,Southwest University of Science and Technology,Mianyang 621010,China

Abstract:	To address the problems of low accuracy and time consuming detection of unknown object grasping pose in the robotic arm grasping detection task, a multi-modal feature grasping pose detection network with attention mechanism is proposed. Firstly, a multi-modal feature fusion module is designed to fuse the multi-modal features and enhance their weighting. Then, to address the problem that the shallow residual network is weak in extracting key features, a convolutional attention module is introduced to further improve the feature extraction ability of the network. Finally, the optimal grasp detection pose is obtained by direct regression fitting of the extracted features through the fully connected layer. The experimental results show that the detection accuracy of image splitting and object splitting on the Cornell grasp dataset is 98.9% and 98.7% respectively, and the detection speed is 51FPS. The success rate is 95% for 100 real-world grabs of 10 types of objects.

Keywords:	target grasping；pose detection；robotic arms；attention mechanisms；multi-modal features；deep learning

	点击此处可从《控制与决策》浏览原始摘要信息
	点击此处可从《控制与决策》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏