CFDP 1240

Stationary Multi Choice Bandit Problems


Publication Date: October 1999

Pages: 12


This note shows that the optimal choice of k simultaneous experiments in a stationary multi-armed bandit problem can be characterized in terms of the Gittins index of each arm. The index characterization remains equally valid after the introduction of switching costs.