Skip to main content
Discussion Paper

Stationary Multi Choice Bandit Problems

This note shows that the optimal choice of k simultaneous experiments in a stationary multi-armed bandit problem can be characterized in terms of the Gittins index of each arm. The index characterization remains equally valid after the introduction of switching costs.