Bandit Problems

We survey the literature on multi-armed bandit models and their applications in economics. The multi-armed bandit problem is a statistical decision model of an agent trying to optimize his decisions while improving his information at the same time. This classic problem has received much attention in economics as it concisely models the trade-oﬀ between exploration (trying out each arm to ﬁnd the best one) and exploitation (playing the arm believed to give the best payoﬀ).

Abstract

We survey the literature on multi-armed bandit models and their applications in economics. The multi-armed bandit problem is a statistical decision model of an agent trying to optimize his decisions while improving his information at the same time. This classic problem has received much attention in economics as it concisely models the trade-oﬀ between exploration (trying out each arm to ﬁnd the best one) and exploitation (playing the arm believed to give the best payoﬀ).

Document
Control
Number(s)

CFDP 1551

Author(s)

Dirk Bergemann

Publication Link

Bandit Problems

Page Count

15

Publication Date

January 2006

Revision Date

May 2022