强化学习的可解释方法分类研究 Classification study of interpretable methods for reinforcement learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

强化学习的可解释方法分类研究

引用本文：	唐蕾,牛园园,王瑞杰,行本贝,王一婷. 强化学习的可解释方法分类研究[J]. 计算机应用研究, 2024, 41(6)

作者姓名：	唐蕾牛园园王瑞杰行本贝王一婷

作者单位：	长安大学信息工程学院,长安大学信息工程学院,长安大学信息工程学院,长安大学信息工程学院,长安大学信息工程学院

摘要：	强化学习能够在动态复杂环境中实现自主学习，这使其在法律、医学、金融等领域有着广泛应用。但强化学习仍面临着全局状态空间不可观测、对奖励函数强依赖和因果关系不确定等诸多问题，导致其可解释性弱，严重影响其在相关领域的推广，会遭遇诸如难以判断决策是否违反社会法律道德的要求，是否准确及值得信任等的限制。为了进一步了解强化学习可解释性研究现状，从可解释模型、可解释策略、环境交互、可视化等方面展开讨论。基于此，对强化学习可解释性研究现状进行系统论述，对其可解释方法进行归类阐述，最后提出强化学习可解释性的未来发展方向。
关键词：	强化学习可解释性策略-值函数环境交互视觉解释
收稿时间：	2023-09-22
修稿时间：	2024-05-13
Classification study of interpretable methods for reinforcement learning

Tang Lei,Niu Yuanyuan,Wang Ruijie,Xing Benbei and Wang Yiting. Classification study of interpretable methods for reinforcement learning[J]. Application Research of Computers, 2024, 41(6)

Authors:	Tang Lei Niu Yuanyuan Wang Ruijie Xing Benbei Wang Yiting

Affiliation:	College of Information Engineering, Chang''an University,,,,

Abstract:	Reinforcement learning can achieve autonomous learning in dynamic and complex environments, which makes it widely used in fields such as law, medicine, and finance. However, reinforcement learning still faces many problems such as the unobservable global state space, strong dependence on the reward function, and uncertain causality, which results in its weak interpretability, seriously affecting its promotion in related fields. It will encounter limitations such as difficulty in judging whether the decision-making violates social legal and moral requirements, whether it is accurate and trustworthy, etc. In order to further understand the current status of interpretability research in reinforcement learning, this article discussed from the aspects of interpretable models, interpretable strategies, environment interaction and visualization, etc. Based on these, this article systematically discussed the research status of reinforcement learning interpretability, classified and explained its explainable methods, and finally proposed the future development direction of reinforcement learning interpretability.

Keywords:	reinforcement learning interpretability strategy-value functions environment interaction visual interpretation

	点击此处可从《计算机应用研究》浏览原始摘要信息
	点击此处可从《计算机应用研究》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏