分层强化学习综述 Summarize of hierarchical reinforcement learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

分层强化学习综述

引用本文：	周文吉,俞扬.分层强化学习综述[J].智能系统学报,2017,12(5):590-594.

作者姓名：	周文吉俞扬

作者单位：	南京大学软件新技术国家重点实验室, 江苏南京 210023

摘要：	强化学习（reinforcement learning）是机器学习和人工智能领域的重要分支，近年来受到社会各界和企业的广泛关注。强化学习算法要解决的主要问题是，智能体如何直接与环境进行交互来学习策略。但是当状态空间维度增加时，传统的强化学习方法往往面临着维度灾难，难以取得好的学习效果。分层强化学习（hierarchical reinforcement learning）致力于将一个复杂的强化学习问题分解成几个子问题并分别解决，可以取得比直接解决整个问题更好的效果。分层强化学习是解决大规模强化学习问题的潜在途径，然而其受到的关注不高。本文将介绍和回顾分层强化学习的几大类方法。
关键词：	人工智能机器学习强化学习分层强化学习深度强化学习马尔可夫决策过程半马尔可夫决策过程维度灾难
Summarize of hierarchical reinforcement learning

ZHOU Wenji,YU Yang.Summarize of hierarchical reinforcement learning[J].CAAL Transactions on Intelligent Systems,2017,12(5):590-594.

Authors:	ZHOU Wenji YU Yang

Affiliation:	National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, China

Abstract:	Reinforcement Learning (RL) is an important research area in the field of machine learning and artificial intelligence and has received increasing attentions in recent years. The goal in RL is to maximize long-term total reward by interacting with the environment. Traditional RL algorithms are limited due to the so-called curse of dimensionality, and their learning abilities degrade drastically with increases in the dimensionality of the state space. Hierarchical reinforcement learning (HRL) decomposes the RL problem into sub-problems and solves each of them to improve learning ability. HRL offers a potential way to solve large-scale RL, which has received insufficient attention to date. In this paper, we introduce and review several main HRL methods.

Keywords:	artificial intelligence machine learning reinforcement learning hierarchical reinforcement learning deep reinforcement learning Markov decision process semi-Markov decision process dimensional curse

	点击此处可从《智能系统学报》浏览原始摘要信息
	点击此处可从《智能系统学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏