SLIDE 13 mon-logo Framework Lower Bound Algorithms Experiments Conclusion
Experiments with Bernoulli distributions
Experiment 5: Arithmetic progression, K = 15, µi = 0.5 − 0.025i, i ∈ {1, . . . , 15}. Experiment 7: Three groups of bad arms, K = 30, µ1 = 0.5, µ2:6 = 0.45, µ7:20 = 0.43, µ21:30 = 0.38.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4
Experiment 5, n=4000 Probability of error
1 : Unif 2−4 : HR 5 : SR 6−9 : UCB−E 10−14 : Ad UCB−E 1 2 3 4 5 6 7 8 9 10 11 12 13 14 0.1 0.2 0.3 0.4 0.5 0.6 0.7
Experiment 7, n=6000 Probability of error
1 : Unif 2−4 : HR 5 : SR 6−9 : UCB−E 10−14 : Ad UCB−E
Jean-Yves Audibert & S´ ebastien Bubeck & R´ emi Munos Best Arm Identification in Multi-Armed Bandits