Improved Bandit Algorithms
Reduced Variance Payoff Estimation in Adversarial Bandit Problems
Levente Kocsis Csaba Szepesv´ ari
Computer and Automation Research Institute of the Hungarian Academy of Sciences Kende u. 13-17, Budapest 1111, Hungary E-mail: szcsaba@sztaki.hu
ECML-2005 Workshop on Reinforcement Learning in Non-stationary Environments Porto, 2005
- L. Kocsis and Cs. Szepesv´
ari Improved Bandit Algorithms