首页 | 本学科首页   官方微博 | 高级检索  
     


Optimal hysteresis for a class of deterministic deteriorating two-armed Bandit problem with switching costs
Authors:F Dusonchet [Author Vitae]  M-O Hongler [Author Vitae]
Affiliation:EPFL-DMT-IPM, Laboratoire de Production Microtechnique (LPM), Institut de Production et Robotique (IPR), CH-1015 Lausanne, Switzerland
Abstract:We derive the optimal policy for the dynamic scheduling of a class of deterministic, deteriorating, continuous time and continuous state two-armed Bandit problems with switching costs. Due to the presence of switching costs, the scheduling policy exhibits an hysteretic character. Using this exactly solvable class of models, we are able to explicitly observe the performance of a sub-optimal policy derived from a set of generalized priority indices (generalized Gittins’ indices) similar to those first introduced in a contribution of Asawa and Teneketzis (IEE Trans. Automat. Control 41 (1996) 328).
Keywords:Multi-armed Bandit process  Switching costs  Optimal switching curves  Hysteretic policy  Priority index policy
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号