RMRL: improved regret minimisation techniques using learning automata |
| |
Authors: | Safiye Ghasemi Mehdi Dehghan Takht-Fooladi Amir Masoud Rahmani |
| |
Affiliation: | 1. Department of Engineering, Sepidan Branch, Islamic Azad University, Sepidan, Iran;2. Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran, Iran;3. Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran;4. Computer Science, University of Human Development, Sulaimanyah, Iraq |
| |
Abstract: | Game theory as one of the most progressive areas in AI in last few years originates from the same root as AI. The unawareness of the other players and their decisions in such incomplete-information problems, make it necessary to use some learning techniques to enhance the decision-making process. Reinforcement learning techniques are studied in this research; regret minimisation (RM) and utility maximisation (UM) techniques as reinforcement learning approaches are widely applied to such scenarios to achieve optimum solutions. In spite of UM, RM techniques enable agents to overcome the shortage of information and enhance the performance of their choices based on regrets, instead of utilities. The idea of merging these two techniques are motivated by iteratively applying UM functions to RM techniques. The main contributions are as follows; first, proposing some novel updating methods based on UM of reinforcement learning approaches for RM; the proposed methods refine RM to accelerate the regret reduction, second, devising different procedures, all relying on RM techniques, in a multi-state predator-prey problem. Third, how the approach, called RMRL, enhances different RM techniques in this problem is studied. Estimated results support the validity of RMRL approach comparing with some UM and RM techniques. |
| |
Keywords: | Incomplete-information games predator-prey problem regret minimisation reinforcement learning |
|
|