首页 | 本学科首页   官方微博 | 高级检索  
     

深度强化学习研究综述
引用本文:赵星宇,丁世飞.深度强化学习研究综述[J].计算机科学,2018,45(7):1-6.
作者姓名:赵星宇  丁世飞
作者单位:中国矿业大学计算机科学与技术学院 江苏 徐州221116,中国矿业大学计算机科学与技术学院 江苏 徐州221116;中国科学院计算技术研究所智能信息处理重点实验室 北京100190
基金项目:本文受国家自然科学基金(61379101,61672522),国家重点基础研究发展计划(2013CB329502)资助
摘    要:作为一种崭新的机器学习方法,深度强化学习将深度学习和强化学习技术结合起来,使智能体能够从高维空间感知信息,并根据得到的信息训练模型、做出决策。由于深度强化学习算法具有通用性和有效性,人们对其进行了广泛的研究,并将其运用到了日常生活的各个领域。首先,对深度强化学习研究进行概述,介绍了深度强化学习的基础理论;然后,分别介绍了基于值函数和基于策略的深度强化学习算法,讨论了其应用前景;最后,对相关研究工作做了总结和展望。

关 键 词:深度强化学习  深度学习  强化学习  人工智能
收稿时间:2017/6/12 0:00:00
修稿时间:2017/8/20 0:00:00

Research on Deep Reinforcement Learning
ZHAO Xing-yu and DING Shi-fei.Research on Deep Reinforcement Learning[J].Computer Science,2018,45(7):1-6.
Authors:ZHAO Xing-yu and DING Shi-fei
Affiliation:School of Computer Science and Technology,China University of Mining and Technology,Xuzhou,Jiangsu 221116,China and School of Computer Science and Technology,China University of Mining and Technology,Xuzhou,Jiangsu 221116,China;Key Laboratory of Intelligent Information Processing,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China
Abstract:As a new machine learning method,deep reinforcement learning combines deep learning and reinforcement learning,which makes that the agent can perceive the information from high dimensional space,train model and make decision according to the received information.Deep reinforcement learning has been widely researched and used in va-rious fields of daily life because of its universality and effectiveness.Firstly,an overview of the deep reinforcement lear-ning research was given and the basic theory of deep reinforcement learning was introduced.Then value-based algorithms and policy-based algorithms were introduced.After that,the application prospects of deep reinfercement learning were discussed.Finally,the related researches were summarized and prospected.
Keywords:Deep reinforcement learning  Deep learning  Reinforcement learning  Artificial intelligence
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号