基于k–最近邻分类增强学习的除冰机器人抓线控制 Line-grasping control of de-icing robot based on k-nearest neighbor reinforcement learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于k–最近邻分类增强学习的除冰机器人抓线控制

引用本文：	魏书宁,王耀南,印峰,杨易旻.基于k–最近邻分类增强学习的除冰机器人抓线控制[J].控制理论与应用,2012,29(4):470-476.

作者姓名：	魏书宁王耀南印峰杨易旻

作者单位：	湖南大学电气与信息工程学院,湖南长沙,410082

基金项目：	国家科技支撑计划资助项目(2008BAF36B01).

摘要：	输电线柔性结构特性给除冰机器人越障抓线控制带来极大困难. 本文提出了一种结合k–最近邻(k-nearest neighbor, KNN)分类算法和增强学习算法的抓线控制方法. 利用基于KNN算法的状态感知机制选择机器人当前状态k个最邻近状态并且对之加权. 根据加权结果决定当前最优动作. 该方法可以得到机器人连续状态的离散表达形式, 从而有效解决传统连续状态泛化方法带来的计算收敛性和维数灾难问题. 借助增强学习算法探测和适应环境的能力, 该方法能够克服机器人模型误差和姿态误差，以及环境干扰等因素对抓线控制的影响. 文中给出了算法具体实现步骤, 并给出了应用此方法控制除冰机器人抓线的仿真实验.
关键词：	除冰机器人 k–最近邻分类算法增强学习维数灾难
收稿时间：	2011/1/11 0:00:00
修稿时间：	2011/6/21 0:00:00
Line-grasping control of de-icing robot based on k-nearest neighbor reinforcement learning

WEI Shu-ning,WANG Yao-nan,YIN Feng and YANG Yi-min.Line-grasping control of de-icing robot based on k-nearest neighbor reinforcement learning[J].Control Theory & Applications,2012,29(4):470-476.

Authors:	WEI Shu-ning WANG Yao-nan YIN Feng and YANG Yi-min

Affiliation:	College of Electrical and Information Engineering, Hunan University,College of Electrical and Information Engineering, Hunan University,College of Electrical and Information Engineering, Hunan University,College of Electrical and Information Engineering, Hunan University

Abstract:	The flexible mechanical characteristic of power lines induces difficulties for line-grasping control for de-icing robots. To deal with this difficulty, we propose for de-icing robots a line-grasping control approach which combines the k-nearest neighbor (KNN) algorithm and the reinforcement-learning (RL). In the learning iteration, the state-perception mechanism of the KNN algorithm selects k-nearest states and weights; from k-weighted states, an optimal action is determined. By expressing a continuous state by k-nearest discrete states in this way, this approach effectively ensures the convergence for the computation and avoids the curse of dimensionality occurred in traditional continuous state-space generalization methods. Abilities of RL in perception and adaptation to the environment make the line-grasping control to tolerate possible errors in robot model, errors of robot arm attitudes and interferences from the environment. The design procedures are presented in details. Simulation results of line-grasping control based on this approach are given.

Keywords:	de-icing robot k-nearest neighbor reinforcement learning curse of dimension
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《控制理论与应用》浏览原始摘要信息
	点击此处可从《控制理论与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏