首页 | 本学科首页   官方微博 | 高级检索  
     


Q-Learning: computation of optimal Q-values for evaluating the learning level in robotic tasks
Authors:Tiziana D'Orazio  Grazia Cicirelli
Affiliation:Department of Computer Science , University of Southern California , Los Angeles, CA, 90089-0782, USA
Abstract:A problem related to the use of reinforcement learning (RL) algorithms on real robot applications is the difficulty of measuring the learning level reached after some experience. Among the different RL algorithms, the Q-learning is the most widely used in accomplishing robotic tasks. The aim of this work is to a priori evaluate the optimal Q-values for problems where it is possible to compute the distance between the current state and the goal state of the system. Starting from the Q-learning updating formula the equations for the maximum Q-weights, for optimal and non-optimal actions, have been computed considering delayed and immediate rewards. Deterministic and non deterministic grid-world environments have been also considered to test in simulations the obtained equations. Besides the convergence rates of the Q-learning algorithm have been compared using different learning rate parameters.
Keywords:Q-LEARNING Optimal Q-VALUES Learning Parameters Convergence Rate
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号