Optimal hysteresis for a class of deterministic deteriorating two-armed Bandit problem with switching costs |
| |
Authors: | F Dusonchet [Author Vitae] M-O Hongler [Author Vitae] |
| |
Affiliation: | EPFL-DMT-IPM, Laboratoire de Production Microtechnique (LPM), Institut de Production et Robotique (IPR), CH-1015 Lausanne, Switzerland |
| |
Abstract: | We derive the optimal policy for the dynamic scheduling of a class of deterministic, deteriorating, continuous time and continuous state two-armed Bandit problems with switching costs. Due to the presence of switching costs, the scheduling policy exhibits an hysteretic character. Using this exactly solvable class of models, we are able to explicitly observe the performance of a sub-optimal policy derived from a set of generalized priority indices (generalized Gittins’ indices) similar to those first introduced in a contribution of Asawa and Teneketzis (IEE Trans. Automat. Control 41 (1996) 328). |
| |
Keywords: | Multi-armed Bandit process Switching costs Optimal switching curves Hysteretic policy Priority index policy |
本文献已被 ScienceDirect 等数据库收录! |