Stationary Multi Choice Bandit Problems

This note shows that the optimal choice of k simultaneous experiments in a stationary multi-armed bandit problem can be characterized in terms of the Gittins index of each arm. The index characterization remains equally valid after the introduction of switching costs.

Abstract

This note shows that the optimal choice of k simultaneous experiments in a stationary multi-armed bandit problem can be characterized in terms of the Gittins index of each arm. The index characterization remains equally valid after the introduction of switching costs.

Document
Control
Number(s)

CFDP 1240

Author(s)

Dirk Bergemann

Publication Link

Stationary Multi Choice Bandit Problems

Page Count

12

Publication Date

October 1999

Revision Date

May 2022