首页 | 本学科首页   官方微博 | 高级检索  
     


RMRL: improved regret minimisation techniques using learning automata
Authors:Safiye Ghasemi  Mehdi Dehghan Takht-Fooladi  Amir Masoud Rahmani
Affiliation:1. Department of Engineering, Sepidan Branch, Islamic Azad University, Sepidan, Iran;2. Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran, Iran;3. Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran;4. Computer Science, University of Human Development, Sulaimanyah, Iraq
Abstract:Game theory as one of the most progressive areas in AI in last few years originates from the same root as AI. The unawareness of the other players and their decisions in such incomplete-information problems, make it necessary to use some learning techniques to enhance the decision-making process. Reinforcement learning techniques are studied in this research; regret minimisation (RM) and utility maximisation (UM) techniques as reinforcement learning approaches are widely applied to such scenarios to achieve optimum solutions. In spite of UM, RM techniques enable agents to overcome the shortage of information and enhance the performance of their choices based on regrets, instead of utilities. The idea of merging these two techniques are motivated by iteratively applying UM functions to RM techniques. The main contributions are as follows; first, proposing some novel updating methods based on UM of reinforcement learning approaches for RM; the proposed methods refine RM to accelerate the regret reduction, second, devising different procedures, all relying on RM techniques, in a multi-state predator-prey problem. Third, how the approach, called RMRL, enhances different RM techniques in this problem is studied. Estimated results support the validity of RMRL approach comparing with some UM and RM techniques.
Keywords:Incomplete-information games  predator-prey problem  regret minimisation  reinforcement learning
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号