共查询到10条相似文献,搜索用时 93 毫秒
1.
C. Ribeiro 《Artificial Intelligence Review》2002,17(3):223-250
Reinforcement Learning (RL) is learning through directexperimentation. It does not assume the existence of a teacher thatprovides examples upon which learning of a task takes place. Instead, inRL experience is the only teacher. With historical roots on the study ofbiological conditioned reflexes, RL attracts the interest of Engineersand Computer Scientists because of its theoretical relevance andpotential applications in fields as diverse as Operational Research andIntelligent Robotics.Computationally, RL is intended to operate in a learning environmentcomposed by two subjects: the learner and a dynamic process. Atsuccessive time steps, the learner makes an observation of the processstate, selects an action and applies it back to the process. Its goal isto find out an action policy that controls the behavior of the dynamicprocess, guided by signals (reinforcements) that indicate how badly orwell it has been performing the required task. These signals are usuallyassociated to a dramatic condition – e.g., accomplishment of a subtask(reward) or complete failure (punishment), and the learner tries tooptimize its behavior by using a performance measure (a function of thereceived reinforcements). The crucial point is that in order to do that,the learner must evaluate the conditions (associations between observedstates and chosen actions) that led to rewards or punishments.Starting from basic concepts, this tutorial presents the many flavorsof RL algorithms, develops the corresponding mathematical tools, assesstheir practical limitations and discusses alternatives that have beenproposed for applying RL to realistic tasks. 相似文献
2.
介绍用模糊逻辑、专家系统和神经网络等人工智能技术描述电梯交通系统的动态特性,讨论了人工智能技术在现代电梯群控系统中的应用。 相似文献
3.
Conventional robot control schemes are basically model-based methods. However, exact modeling of robot dynamics poses considerable problems and faces various uncertainties in task execution. This paper proposes a reinforcement learning control approach for overcoming such drawbacks. An artificial neural network (ANN) serves as the learning structure, and an applied stochastic real-valued (SRV) unit as the learning method. Initially, force tracking control of a two-link robot arm is simulated to verify the control design. The simulation results confirm that even without information related to the robot dynamic model and environment states, operation rules for simultaneous controlling force and velocity are achievable by repetitive exploration. Hitherto, however, an acceptable performance has demanded many learning iterations and the learning speed proved too slow for practical applications. The approach herein, therefore, improves the tracking performance by combining a conventional controller with a reinforcement learning strategy. Experimental results demonstrate improved trajectory tracking performance of a two-link direct-drive robot manipulator using the proposed method. 相似文献
4.
5.
电梯群控系统的任务是有效地运送乘客,提高电梯运行效率,改善服务质量,根据大楼不同交通流状况识别不同的交通流模式,并采用最合适的调度方法分派电梯是提高群控性能的关键,实现了一种基于模式识别的智能多模式电梯群控调度方法,该方法可以在一天中根据不同的交通流状况,提供不同的群控策略,从而使电梯服务更优,仿真实验表明了这种电梯调度方法是有效的。 相似文献
6.
7.
8.
基于人工神经网络的自适应模糊电梯群控系统 总被引:1,自引:0,他引:1
针对电梯客流的随机性、离散性及电梯群控系统智能调度的复杂性。综合考虑了电梯运行的评价标准,结合了人工神经网络和模糊控制各自的优点,用神经网络的学习机制为模糊控制器自动提取并调整模糊规则及模糊隶属函数。提高了电梯群控的智能性。合理分配电梯应答,防止聚堆和忙闲不均情况的发生。大大减少了平均候梯时间和长候梯率。仿真实验及初步应用结果表明这种电梯调度方法是有效的。 相似文献
9.
10.
基于Canbus的电梯群控系统的设计 总被引:1,自引:0,他引:1
针对电梯和微机之间实时高速通讯的特点,结合现场总线技术尤其是CAN总线技术在国内的推广应用,完成了控制系统的设计。该系统是基于微机作为上位监控计算机,Pc1-841CAN卡等作为下位机的遵循CAN通信规程的通信系统。对系统的硬件结构和软件设计进行了深入的研究和系统的阐述,提出了实现方案。 相似文献