首页 | 本学科首页   官方微博 | 高级检索  
     

强化学习的可解释方法分类研究
引用本文:唐蕾,牛园园,王瑞杰,行本贝,王一婷. 强化学习的可解释方法分类研究[J]. 计算机应用研究, 2024, 41(6)
作者姓名:唐蕾  牛园园  王瑞杰  行本贝  王一婷
作者单位:长安大学信息工程学院,长安大学信息工程学院,长安大学信息工程学院,长安大学信息工程学院,长安大学信息工程学院
摘    要:强化学习能够在动态复杂环境中实现自主学习,这使其在法律、医学、金融等领域有着广泛应用。但强化学习仍面临着全局状态空间不可观测、对奖励函数强依赖和因果关系不确定等诸多问题,导致其可解释性弱,严重影响其在相关领域的推广,会遭遇诸如难以判断决策是否违反社会法律道德的要求,是否准确及值得信任等的限制。为了进一步了解强化学习可解释性研究现状,从可解释模型、可解释策略、环境交互、可视化等方面展开讨论。基于此,对强化学习可解释性研究现状进行系统论述,对其可解释方法进行归类阐述,最后提出强化学习可解释性的未来发展方向。

关 键 词:强化学习   可解释性   策略-值函数   环境交互   视觉解释
收稿时间:2023-09-22
修稿时间:2024-05-13

Classification study of interpretable methods for reinforcement learning
Tang Lei,Niu Yuanyuan,Wang Ruijie,Xing Benbei and Wang Yiting. Classification study of interpretable methods for reinforcement learning[J]. Application Research of Computers, 2024, 41(6)
Authors:Tang Lei  Niu Yuanyuan  Wang Ruijie  Xing Benbei  Wang Yiting
Affiliation:College of Information Engineering, Chang''an University,,,,
Abstract:Reinforcement learning can achieve autonomous learning in dynamic and complex environments, which makes it widely used in fields such as law, medicine, and finance. However, reinforcement learning still faces many problems such as the unobservable global state space, strong dependence on the reward function, and uncertain causality, which results in its weak interpretability, seriously affecting its promotion in related fields. It will encounter limitations such as difficulty in judging whether the decision-making violates social legal and moral requirements, whether it is accurate and trustworthy, etc. In order to further understand the current status of interpretability research in reinforcement learning, this article discussed from the aspects of interpretable models, interpretable strategies, environment interaction and visualization, etc. Based on these, this article systematically discussed the research status of reinforcement learning interpretability, classified and explained its explainable methods, and finally proposed the future development direction of reinforcement learning interpretability.
Keywords:reinforcement learning   interpretability   strategy-value functions   environment interaction   visual interpretation
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号