期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Multi-agent deep reinforcement learning: a survey

Gronauer Sven Diepold Klaus 《Artificial Intelligence Review》2022,55(2):895-943

The advances in reinforcement learning have recorded sublime success in various domains. Although the multi-agent domain has been overshadowed by its single-agent counterpart during this progress, multi-agent reinforcement learning gains rapid traction, and the latest accomplishments address problems with real-world complexity. This article provides an overview of the current developments in the field of multi-agent deep reinforcement learning. We focus primarily on literature from recent years that combines deep reinforcement learning methods with a multi-agent scenario. To survey the works that constitute the contemporary landscape, the main contents are divided into three parts. First, we analyze the structure of training schemes that are applied to train multiple agents. Second, we consider the emergent patterns of agent behavior in cooperative, competitive and mixed scenarios. Third, we systematically enumerate challenges that exclusively arise in the multi-agent domain and review methods that are leveraged to cope with these challenges. To conclude this survey, we discuss advances, identify trends, and outline possible directions for future work in this research area.

相似文献

2.

Emotion in reinforcement learning agents and robots: a survey

Thomas M. Moerland Joost Broekens Catholijn M. Jonker 《Machine Learning》2018,107(2):443-480

This article provides the first survey of computational models of emotion in reinforcement learning (RL) agents. The survey focuses on agent/robot emotions, and mostly ignores human user emotions. Emotions are recognized as functional in decision-making by influencing motivation and action selection. Therefore, computational emotion models are usually grounded in the agent’s decision making architecture, of which RL is an important subclass. Studying emotions in RL-based agents is useful for three research fields. For machine learning (ML) researchers, emotion models may improve learning efficiency. For the interactive ML and human–robot interaction community, emotions can communicate state and enhance user investment. Lastly, it allows affective modelling researchers to investigate their emotion theories in a successful AI agent class. This survey provides background on emotion theory and RL. It systematically addresses (1) from what underlying dimensions (e.g. homeostasis, appraisal) emotions can be derived and how these can be modelled in RL-agents, (2) what types of emotions have been derived from these dimensions, and (3) how these emotions may either influence the learning efficiency of the agent or be useful as social signals. We also systematically compare evaluation criteria, and draw connections to important RL sub-domains like (intrinsic) motivation and model-based RL. In short, this survey provides both a practical overview for engineers wanting to implement emotions in their RL agents, and identifies challenges and directions for future emotion-RL research. 相似文献

3.

Reinforcement learning in robotic applications: a comprehensive survey

Singh Bharat Kumar Rajesh Singh Vinay Pratap 《Artificial Intelligence Review》2022,55(2):945-990

In recent trends, artificial intelligence (AI) is used for the creation of complex automated control systems. Still, researchers are trying to make a completely autonomous system that resembles human beings. Researchers working in AI think that there is a strong connection present between the learning pattern of human and AI. They have analyzed that machine learning (ML) algorithms can effectively make self-learning systems. ML algorithms are a sub-field of AI in which reinforcement learning (RL) is the only available methodology that resembles the learning mechanism of the human brain. Therefore, RL must take a key role in the creation of autonomous robotic systems. In recent years, RL has been applied on many platforms of the robotic systems like an air-based, under-water, land-based, etc., and got a lot of success in solving complex tasks. In this paper, a brief overview of the application of reinforcement algorithms in robotic science is presented. This survey offered a comprehensive review based on segments as (1) development of RL (2) types of RL algorithm like; Actor-Critic, DeepRL, multi-agent RL and Human-centered algorithm (3) various applications of RL in robotics based on their usage platforms such as land-based, water-based and air-based, (4) RL algorithms/mechanism used in robotic applications. Finally, an open discussion is provided that potentially raises a range of future research directions in robotics. The objective of this survey is to present a guidance point for future research in a more meaningful direction.

相似文献

4.

Model-free reinforcement learning from expert demonstrations: a survey

Ramírez Jorge Yu Wen Perrusquía Adolfo 《Artificial Intelligence Review》2022,55(4):3213-3241

Reinforcement learning from expert demonstrations (RLED) is the intersection of imitation learning with reinforcement learning that seeks to take advantage of these two learning approaches. RLED uses demonstration trajectories to improve sample efficiency in high-dimensional spaces. RLED is a new promising approach to behavioral learning through demonstrations from an expert teacher. RLED considers two possible knowledge sources to guide the reinforcement learning process: prior knowledge and online knowledge. This survey focuses on novel methods for model-free reinforcement learning guided through demonstrations, commonly but not necessarily provided by humans. The methods are analyzed and classified according to the impact of the demonstrations. Challenges, applications, and promising approaches to improve the discussed methods are also discussed.

相似文献

5.

A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning

Morales Eduardo F. Murrieta-Cid Rafael Becerra Israel Esquivel-Basaldua Marco A. 《Intelligent Service Robotics》2021,14(5):773-805

This article is about deep learning (DL) and deep reinforcement learning (DRL) works applied to robotics. Both tools have been shown to be successful in delivering data-driven solutions for robotics tasks, as well as providing a natural way to develop an end-to-end pipeline from the robot’s sensing to its actuation, passing through the generation of a policy to perform the given task. These frameworks have been proven to be able to deal with real-world complications such as noise in sensing, imprecise actuation, variability in the scenarios where the robot is being deployed, among others. Following that vein, and given the growing interest in DL and DRL, the present work starts by providing a brief tutorial on deep reinforcement learning, where the goal is to understand the main concepts and approaches followed in the field. Later, the article describes the main, recent, and most promising approaches of DL and DRL in robotics, with sufficient technical detail to understand the core of the works and to motivate interested readers to initiate their own research in the area. Then, to provide a comparative analysis, we present several taxonomies in which the references can be classified, according to high-level features, the task that the work addresses, the type of system, and the learning techniques used in the work. We conclude by presenting promising research directions in both DL and DRL.

相似文献

6.

Enhancing computer graphics through machine learning: a survey

Jonathan Dinerstein Parris K. Egbert David Cline 《The Visual computer》2007,23(1):25-43

Machine learning has experienced explosive growth in the last few decades, achieving sufficient maturity to provide effective tools for sundry scientific and engineering fields. Machine learning provides a firm theoretical foundation upon which to build techniques that leverage existing data to extract interesting information or to synthesize more data.In this paper we survey the uses of machine learning methods and concepts in recent computer graphics techniques. Many graphics techniques are data-driven; however, few graphics papers explicitly leverage the machine learning literature to underpin, validate, and develop their proposed methods. This survey provides novel insights by casting many existing computer graphics techniques into a common learning framework. This not only illuminates how these techniques are related, but also reveals possible ways in which they may be improved. We also use our analysis to propose several directions for future work. 相似文献

7.

RLINK: Deep reinforcement learning for user identity linkage

Li Xiaoxue Cao Yanan Li Qian Shang Yanmin Li Yangxi Liu Yanbing Xu Guandong 《World Wide Web》2021,24(1):85-103

World Wide Web - User identity linkage is a task of recognizing the identities of the same user across different social networks (SN). Previous works tackle this problem via estimating the pairwise... 相似文献

8.

A survey of inverse reinforcement learning

Adams Stephen Cody Tyler Beling Peter A. 《Artificial Intelligence Review》2022,55(6):4307-4346

相似文献

9.

Deep reinforcement learning for urban multi-taxis cruising strategy

Guo Weian Hua Zhenyao Kang Zecheng Li Dongyang Wang Lei Wu Qidi Lerch Alexander 《Neural computing & applications》2022,34(19):16275-16289

Neural Computing and Applications - Taxis play an important role in urban transportation system. Efficient taxi cruising strategies are helpful to alleviate urban traffic congestions, reduce... 相似文献

10.

Markov Random Field modeling,inference & learning in computer vision & image understanding: A survey

Chaohui Wang Nikos Komodakis Nikos Paragios 《Computer Vision and Image Understanding》2013,117(11):1610-1627

In this paper, we present a comprehensive survey of Markov Random Fields (MRFs) in computer vision and image understanding, with respect to the modeling, the inference and the learning. While MRFs were introduced into the computer vision field about two decades ago, they started to become a ubiquitous tool for solving visual perception problems around the turn of the millennium following the emergence of efficient inference methods. During the past decade, a variety of MRF models as well as inference and learning methods have been developed for addressing numerous low, mid and high-level vision problems. While most of the literature concerns pairwise MRFs, in recent years we have also witnessed significant progress in higher-order MRFs, which substantially enhances the expressiveness of graph-based models and expands the domain of solvable problems. This survey provides a compact and informative summary of the major literature in this research topic. 相似文献

11.

Deep reinforcement learning based control for Autonomous Vehicles in CARLA

Pérez-Gil Óscar Barea Rafael López-Guillén Elena Bergasa Luis M. Gómez-Huélamo Carlos Gutiérrez Rodrigo Díaz-Díaz Alejandro 《Multimedia Tools and Applications》2022,81(3):3553-3576

Nowadays, Artificial Intelligence (AI) is growing by leaps and bounds in almost all fields of technology, and Autonomous Vehicles (AV) research is one more of them. This paper proposes the using of algorithms based on Deep Learning (DL) in the control layer of an autonomous vehicle. More specifically, Deep Reinforcement Learning (DRL) algorithms such as Deep Q-Network (DQN) and Deep Deterministic Policy Gradient (DDPG) are implemented in order to compare results between them. The aim of this work is to obtain a trained model, applying a DRL algorithm, able of sending control commands to the vehicle to navigate properly and efficiently following a determined route. In addition, for each of the algorithms, several agents are presented as a solution, so that each of these agents uses different data sources to achieve the vehicle control commands. For this purpose, an open-source simulator such as CARLA is used, providing to the system with the ability to perform a multitude of tests without any risk into an hyper-realistic urban simulation environment, something that is unthinkable in the real world. The results obtained show that both DQN and DDPG reach the goal, but DDPG obtains a better performance. DDPG perfoms trajectories very similar to classic controller as LQR. In both cases RMSE is lower than 0.1m following trajectories with a range 180-700m. To conclude, some conclusions and future works are commented.

相似文献

12.

Deep reinforcement learning methods for structure-guided processing path optimization

Dornheim Johannes Morand Lukas Zeitvogel Samuel Iraki Tarek Link Norbert Helm Dirk 《Journal of Intelligent Manufacturing》2022,33(1):333-352

Journal of Intelligent Manufacturing - A major goal of materials design is to find material structures with desired properties and in a second step to find a processing path to reach one of these... 相似文献

13.

Deep reinforcement learning for drone navigation using sensor data

Hodge Victoria J. Hawkins Richard Alexander Rob 《Neural computing & applications》2021,33(6):2015-2033

Neural Computing and Applications - Mobile robots such as unmanned aerial vehicles (drones) can be used for surveillance, monitoring and data collection in buildings, infrastructure and... 相似文献

14.

Super-resolution: a comprehensive survey 总被引：3，自引：0，他引：3

Kamal Nasrollahi Thomas B. Moeslund 《Machine Vision and Applications》2014,25(6):1423-1468

Super-resolution, the process of obtaining one or more high-resolution images from one or more low-resolution observations, has been a very attractive research topic over the last two decades. It has found practical applications in many real-world problems in different fields, from satellite and aerial imaging to medical image processing, to facial image analysis, text image analysis, sign and number plates reading, and biometrics recognition, to name a few. This has resulted in many research papers, each developing a new super-resolution algorithm for a specific purpose. The current comprehensive survey provides an overview of most of these published works by grouping them in a broad taxonomy. For each of the groups in the taxonomy, the basic concepts of the algorithms are first explained and then the paths through which each of these groups have evolved are given in detail, by mentioning the contributions of different authors to the basic concepts of each group. Furthermore, common issues in super-resolution algorithms, such as imaging models and registration algorithms, optimization of the cost functions employed, dealing with color information, improvement factors, assessment of super-resolution algorithms, and the most commonly employed databases are discussed. 相似文献

15.

增强学习中的直接策略搜索方法综述 总被引：1，自引：0，他引：1

王学宁陈伟张锰徐昕贺汉根《智能系统学报》2007,2(1):16-24

对增强学习中各种策略搜索算法进行了简单介绍，建立了策略梯度方法的理论框架，并且根据这个理论框架的指导，对一些现有的策略梯度算法进行了推广，讨论了近年来出现的提高策略梯度算法收敛速度的几种方法-对于非策略梯度搜索算法的最新进展进行了介绍，对进一步研究工作的方向进行了展望．相似文献

16.

Applications,databases and open computer vision research from drone videos and images: a survey

Akbari Younes Almaadeed Noor Al-maadeed Somaya Elharrouss Omar 《Artificial Intelligence Review》2021,54(5):3887-3938

Analyzing videos and images captured by unmanned aerial vehicles or aerial drones is an emerging application attracting significant attention from researchers in various areas of computer vision. Currently, the major challenge is the development of autonomous operations to complete missions and replace human operators. In this paper, based on the type of analyzing videos and images captured by drones in computer vision, we have reviewed these applications by categorizing them into three groups. The first group is related to remote sensing with challenges such as camera calibration, image matching, and aerial triangulation. The second group is related to drone-autonomous navigation, in which computer vision methods are designed to explore challenges such as flight control, visual localization and mapping, and target tracking and obstacle detection. The third group is dedicated to using images and videos captured by drones in various applications, such as surveillance, agriculture and forestry, animal detection, disaster detection, and face recognition. Since most of the computer vision methods related to the three categories have been designed for real-world conditions, providing real conditions based on drones is impossible. We aim to explore papers that provide a database for these purposes. In the first two groups, some survey papers presented are current. However, the surveys have not been aimed at exploring any databases. This paper presents a complete review of databases in the first two groups and works that used the databases to apply their methods. Vision-based intelligent applications and their databases are explored in the third group, and we discuss open problems and avenues for future research.

相似文献

17.

Understanding minority costumes: a computer vision perspective

Zhang Qian Yang Yu-cheng Yue Shi-qin Shao Ding-qin Wang Lin 《Multimedia Systems》2020,26(2):191-200

Multimedia Systems - It is an extremely interesting work to understand the minority costumes in computer vision and ethnology community. It explored some crucial clue for understanding minority... 相似文献

18.

Deep Web crawling: a survey

Hernández Inma Rivero Carlos R. Ruiz David 《World Wide Web》2019,22(4):1577-1610

World Wide Web - Deep Web crawling refers to the problem of traversing the collection of pages in a deep Web site, which are dynamically generated in response to a particular query that is... 相似文献

19.

Deep Kernel machines: a survey

Nikhitha Nair K. Afzal A. L. Asharaf S. 《Pattern Analysis & Applications》2021,24(2):537-556

Pattern Analysis and Applications - The emergence of deep learning frameworks paves the way for achieving higher-level data abstractions and possess the potential in consolidating both supervised... 相似文献

20.

Deep reinforcement learning for permanent magnet synchronous motor speed control systems

Song Zhe Yang Jun Mei Xuesong Tao Tao Xu Muxun 《Neural computing & applications》2021,33(10):5409-5418

Neural Computing and Applications - The permanent magnet synchronous motor (PMSM) servo system is widely applied in many industrial fields due to its unique advantages. In this paper, we study the... 相似文献