R‐learning with multiple state‐action value tables期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

R‐learning with multiple state‐action value tables

Authors:	Koichiro Ishikawa Akito Sakurai Tsutomu Fujinami Susumu Kunifuji

Affiliation:	1. Japan Advanced Institute of Science and Technology, Japan;2. Keio University, Japan

Abstract:	We propose a method to improve the performance of R‐learning, a reinforcement learning algorithm, by using multiple state‐action value tables. Unlike Q‐ or Sarsa learning, R‐learning learns a policy to maximize undiscounted rewards. Multiple state‐action value tables cause substantial explorations as needed and make R‐learning work well. Efficiency of the proposed method is verified through experiments in a simulated environment. © 2007 Wiley Periodicals, Inc. Electr Eng Jpn, 159(3): 34– 47, 2007; Published online in Wiley InterScience ( www.interscience. wiley.com ). DOI 10.1002/eej.20473

Keywords:	reinforcement learning R‐learning autonomous mobile robot