基于复合协方差函数的多任务模仿学习算法的研究与实现 Multitask Imitation Learning Algorithm Based on Composite Covariance Function期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于复合协方差函数的多任务模仿学习算法的研究与实现

引用本文：	于建均, 韩春晓, 阮晓钢, 刘涛, 徐骢驰, 门玉森. 基于复合协方差函数的多任务模仿学习算法的研究与实现[J]. 北京工业大学学报, 2016, 42(4): 499-507. DOI: 10.11936/bjutxb2015030055

作者姓名：	于建均韩春晓阮晓钢刘涛徐骢驰门玉森

作者单位：	1.北京工业大学电子信息与控制工程学院,北京 100124

基金项目：	国家自然科学基金项目(61375086)，高等学校博士学科点专项科研基金资助课题(20101103110007)

摘要：	针对多任务下机器人模仿学习控制策略的获取问题,构建复合协方差函数,采用高斯过程回归方法对示教机器人的示教行为样本点建立高斯过程回归模型,并对其中的超参数进行优化,从而得出模仿学习控制策略,模仿机器人应用控制策略完成模仿任务.以Braitenberg车为仿真实验研究对象,对其趋光、避障多任务的模仿学习进行研究.仿真实验研究结果表明:与基于单一协方差函数的模仿学习算法相比,基于复合协方差函数的模仿学习算法不仅能够实现单任务环境下的机器人模仿学习,而且能够实现多任务环境下的机器人模仿学习,且精度更高.任务环境改变实验研究结果表明该方法有很好的适应性.
关键词：	机器人模仿学习高斯过程回归复合协方差函数
收稿时间：	2015-03-19
Multitask Imitation Learning Algorithm Based on Composite Covariance Function

YU Jianjun, HAN Chunxiao, RUAN Xiaogang, LIU Tao, XU Congchi, MEN Yusen. Multitask Imitation Learning Algorithm Based on Composite Covariance Function[J]. Journal of Beijing University of Technology, 2016, 42(4): 499-507. DOI: 10.11936/bjutxb2015030055

Authors:	YU Jianjun HAN Chunxiao RUAN Xiaogang LIU Tao XU Congchi MEN Yusen

Affiliation:	1.College of Electronic and Control Engineering,Beijing University of Technology, Beijing 100124, China

Abstract:	To acquire the multitask robot imitation learning control strategy, a Gauss process regression ( GPR) model was established to express the control strategy, a composite covariance function was constructed, and the sample points of the teaching behavior was used to optimized the hyperparameters in the GPR model. The control strategy was applied by the imitation robot to accomplish the imitation task. The Braitenberg vehicles were used as simulation object to research multitask ( phototaxis and obstacle avoidance tasks) imitation learning. Simulation results indicate that compared with the imitation learning algorithm based on the single covariance function, the imitation learning algorithm based on the composite covariance function can not only realize single task imitation learning, but also realize multitask imitation learning, and the precision is higher. The simulation results in various task environments indicate that the method is adaptive.

Keywords:	robot imitation learning Gaussian process regression ( GPR) composite covariance function
本文献已被万方数据等数据库收录！
	点击此处可从《北京工业大学学报》浏览原始摘要信息
	点击此处可从《北京工业大学学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏